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Introduction 


The Lorentz group is essentially the symmetry group of special relativity. It is commonly 
defined as a set of (linear) transformations acting on a four-dimensional vector space 

representing changes of inertial frames in Minkowski space-time. But as we will see 
below, one can exhibit an isomorphism between the Lorentz group and the group of con¬ 
formal transformations of the sphere the latter is of course two-dimensional. This 
isomorphism thus relates the action of a group on a /onr-dimensional space to its action 
on a fwo-dimensional manifold. At first sight, such a relation seems surprising: loosely 
speaking, one expects to have lost some information in going from four to two dimensions. 
In particular, the isomorphism looks like a coincidence of the group structure: there is no 
obvious geometric relation between the original four-dimensional space on the one hand, 
and the sphere on the other hand. 

The purpose of these notes is to show that such a relation actually exists, and is 
even quite natural. Indeed, by defining a notion of “celestial spheres”, one can derive a 
direct link between four-dimensional Minkowski space and the two-dimensional sphere. 
In short, the celestial sphere of an inertial observer in Minkowski space is the sphere of all 
directions towards which the observer can look, and coordinate transformations between 
inertial observers (he. Lorentz transformations) correspond to conformal transformations 
of this sphere [1-4]. In this work we will review this construction in a self-contained way. 

Keeping this motivation in mind, the text is organized as follows. In section 1, we re¬ 
view the basic principles of special relativity and define the natural symmetry groups 
that follow, namely the Poincare group and its homogeneous subgroup, the Lorentz 
group [5-7]. This will also be an excuse to discuss certain elegant properties of the Lorentz 
group that are seldom exposed in elementary courses on special relativity, in particular 
regarding the physical meaning of the notion of “rapidity” [8-10]. In section 2, we then 
establish the isomorphism between the connected Lorentz group and the group SL(2, C) 
of complex, two by two matrices of unit determinant, quotiented by its center Z 2 . We also 
derive the analogue of this result in three space-time dimensions. Section 3 is devoted 
to the construction of conformal transformations of the sphere; it is shown, in particular, 
that such transformations span a group isomorphic to SL(2, C)/Z 2 — a key result in the 
realm of two-dimensional conformal field theories [11,12]. At that point, the stage will 
be set for the final link between the Lorentz group and the sphere, which is established 
in section 4. The conclusion, section 5, relates these observations to some recent develop¬ 
ments in quantum gravity — in particular BMS symmetry [13-20] and holography [21-25]. 

The presentation voluntarily starts with fairly elementary considerations, in order to 
be accessible (hopefully) to undergraduate students. Though some basic knowledge of 
group theory and special relativity should come in handy, no prior knowledge of differ¬ 
ential geometry, general relativity or conformal field theory is assumed. In particular, 
sections 1 and 2 are mostly based on the undergraduate-level lecture notes [5]. 


1 Special relativity and the Lorentz group 

In this section, after reviewing the basic principles of special relativity (subsection 1.1), 
we define the associated symmetry groups (subsection 1.2) and introduce in particular 
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the Lorentz group. In subsection 1.3, we then define certain natural subgroups of the 
latter. Subsection 1.4 is devoted to the notion of Lorentz boosts and to the associated 
additive parameter, which turns out to have the physical meaning of “rapidity”. Finally, 
in subsection 1.5 we show that any Lorentz transformation preserving the orientation of 
space and the direction of time flow can be written as the product of two rotations and 
a boost, and then use this result in subsection 1.6 to classify the connected components 
of the Lorentz group. All these results are well known; the acquainted reader may safely 
jump directly to section 2. The presentation of this section is mainly inspired from the 
lecture notes [5] and [6]; more specialized references will be cited in due time. 

1.1 The principles of special relativity 

1.1.1 Events and reference frames 

In special relativity, natural phenomena take place in the arena of space-time. The latter 
consists of points, called events, which occur at some position in space, at some moment 
in time. Events are seen by observers who use coordinate systems, also called reference 
frames, to specify the location of an event in space-time. In the realm of special relativity, 
reference frames typically consist of three orthonormal spatial coordinates^ {x^,x‘^,x^) 
and one time coordinate t, measured by a clock carried by the observer. For practical 
purposes, the speed of light in the vacuum, 

c = 299 792 458 m/s, (1.1) 

is used as a conversion factor to express time as a quantity with dimensions of distance. 
This is done by defining a new time coordinate = ct. 

Thus, in a given reference frame, an event occurring in space-time is labelled by its four 
coordinates (a;°, x^,x‘^, x^), collectively denoted as (x^). (From now one, greek indices run 
over the values 0, 1, 2, 3.) Of course, the event’s existence is independent of the observers 
who see it, but its coordinates are not: if Alice and Bob are two observers looking at the 
same event, Alice may use a set of four numbers (x^) to describe its location, but Bob will 
in general use different coordinates (x'^) to locate the same event. Besides, if we do not 
specify further the relation between Alice and Bob, there is no link whatsoever between 
the coordinates they use. What we need are restrictions on the possible reference frames 
used by Alice and Bob; the principles of special relativity will then apply only to those 
observers whose reference frames satisfy the given restrictions. 

Whe presentation here is confined to four-dimensional space-times, but the generalization to d- 
dimensional space-times is straightforward: simply take spatial coordinates {x^, ...,x‘^~^). 
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Figure 1; Alice (drawn in black) and Bob (drawn in red) in space-time, with their respec¬ 
tive reference frames (including clocks); for simplicity, only two spatial dimensions are 
represented. Looking at the same event, Alice and Bob will typically associate different 
coordinates with it. In general, there is no way to relate the coordinates of Alice’s frame 
to those of Bob’s frame. 


1.1.2 The principles of special relativity 

We now state the defining assumptions of special relativity. The three first basic assump¬ 
tions are homogeneity of space-time, isotropy of space and causality [7,8]. The remaining 
principles, discussed below, lead to constraints on the relation between reference frames. 
To expose these principles, we first need to define the notion of inertial frames. 

According to the principle of inertia^ a body left to itself, without forces acting on it, 
should move in space in a constant direction, with a constant velocity. Obviously, this 
principle cannot hold in all reference frames. For example, suppose Alice observes that 
the principle of inertia is true in her reference frame (for example by throwing tennis balls 
in space and observing that they move in straight lines at constant velocity). Then, if 
Bob is accelerated with respect to Alice, he will naturally use a comoving frame and the 
straight motions seen by Alice will become curved motions in his reference frame. There¬ 
fore, if two reference frames are accelerated with respect to each other, the principle of 
inertia cannot hold in both frames. More generally, we call inertial frame a reference 
frame in which the principle of inertia holds [7]; the results of special relativity apply 
only to such frames. Accordingly, an observer using an inertial frame is called an inertial 
observer; in physical terms, it is an observer falling freely in empty space. We will see 
in subsection 1.2 what restrictions are imposed on the relation between coordinates of 
inertial frames; at present, we already know that, if two such frames move with respect 
to each other, then this motion must take place along a straight line, at constant velocity. 

Given this definition, we are in position to state the two crucial defining principles 
of special relativity. The first, giving its name to the theory, is the principle of relativity 


5 




(in the restricted sense [26]), which states that the laws of Nature must take the same 
form in all inertial frames. In other words, according to this principle, there exists no 
privileged inertial frame in the Universe: there is no experiment that would allow an 
experimenter to distinguish a given inertial frame from the others. This is a principle of 
special relativity in that it only applies to inertial frames; a principle of general relativity 
would apply to all possible reference frames, inertial or not. The latter principle leads to 
the theory of general relativity, which we will not discuss further here. 

The second principle is Einstein’s historical “second postulate”, which states that the 
speed of light in the vacuum takes the same value c, written in (1.1), in all inertial frames. 
In fact, if one assumes that Maxwell’s theory of electromagnetism holds, then the second 
postulate is a consequence of the principle of relativity. Indeed, saying that the speed of 
light (in the vacuum) is the same in all inertial frames is really saying that the laws of 
electromagnetism are identical in all inertial frames [7]. 

1.2 The Poincare group and the Lorentz group 

We now work out the relation between coordinates of inertial frames; the set of all such 
relations will form a group, called the Poincare group. We will see that the second 
postulate is crucial in determining the form of this group, through the notion of “space- 
time intervaf’. 


1.2.1 Linear structure 


Suppose A and B are two inertial frames, i.e. the principle of inertia holds in both of 
them. Then, a particle moving along a straight line at constant velocity, as seen from 
A, must also move at constant velocity along a straight line when seen from B. Thus, 
calling (x^) the space-time coordinates of A and {x'^) those of B, the relation between 
these coordinates must be such that any straight line in the coordinates x^ is mapped 
on a straight line in the coordinates x'^. The most general transformation satisfying this 
property is a projective map [6], for which 


h -F c^x^^ 


yp = 0 , 1 , 2 , 3 . 


( 1 . 2 ) 


(From now on, summation over repeated indices will always be understood.) Here a^, 
b and are constant coefficients. If we insist that points having finite values of 
coordinates in A remain with finite coordinates in H, we must set = 0. Then, absorbing 
the constant b in the parameters and the transformation (1.2) reduces to 


+ a>^. 


(1.3) 


Thus, the principle of inertia endows space-time with a linear structure. In order for the 
transformation (1.3) to be invertible, we must also demand that the matrix A = {A^^) 
be invertible. Apart from that, using only the principle of inertia, we cannot go further 
at this point. The principle of relativity will set additional restrictions on A. 


1.2.2 luvariauce of the interval 

Let again A be an inertial frame and let V and Q be two events in space-time. Call 
Ax^ the components of the vector going from P to Q in the frame A. Then, we call the 
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number 

As^ = —(Ax°)^ + (Ax^)^ + (Ax^y + (Ax^)^ = rj^yAx^Ax'" (1.4) 

the square of the interval between V and Q. The matrix 


V = 


/-I 0 0 0\ 
0 10 0 
0 0 10 
\ 0 0 0 1 / 


(1.5) 


appearing in this dehnition is called the Minkowski metric matrix. The terminology 
associated with dehnition (1.4) may seem inconsistent, in that we call “square of the 
interval between two events” a quantity that seems to depend not only on the events, but 
also on the coordinates chosen to locate them (in the present case, the separation Ax^). 
This is not the case, however, thanks to the following important result: 


Proposition. Let A and B be two inertial frames, V and Q two events, the square of 
the interval between them being As^ in the coordinates of A and As'^ in those of B. 
Then, 

As^ = As^^ (“invariance of the interval”). (1-6) 

In other words, the number (1.4) does not depend on the inertial coordinates used to 
dehne it. 

Proof. First suppose that V and Q are light-like separated, i.e. that there exists a light 
ray going from V to Q (or from Q to V). Then, As^ = 0 by construction. But, by 
Einstein’s second postulate, the speed of light is the same in both reference frames A and 
B, so As'^ = 0 as well. Thus, 


As^ = 0 iff As'2 = 0. 

Now, since A and B are inertial frames, the relation between their coordinates must be 
of the linear form (1.3); in particular, Ax^ = A^yAx'^. Therefore As'^ is a polynomial 
of second order in the components Ax^. But we have just seen that As^ and As'^ have 
identical roots; since polynomials having identical roots are necessarily proportional to 
each other, we know that there exists some number K such that 

As'2 = KAs^. (1.7) 

This number K depends on the matrix A appearing in Ax^ = A^yAx‘', which itself de¬ 
pends on the velocity v of the frame B with respect to the frame A. (This velocity is 
constant, since accelerated frames cannot be inertial.) But space is isotropic by assump¬ 
tion, so K actually depends only on the modulus ||F|| of n, and not on its direction. In 
particular, K{v) = K{—v). Since the velocity of A with respect to B is —n, we know 
that 

As^ = KAs'^ ^ = ^ AT^As^ 

implyiing that = 1. Since real transformations cannot change the signature of a 
quadratic form, K cannot be negative, so K = 1. ■ 
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1.2.3 Lorentz transformations 


We now know that coordinate transformations between inertial frames must preserve the 
square of the interval; let us work out the consequences of this statement for the matrix 
A in (1.3). To simplify notations, we will see and (a^) as four-component column 
vectors, so that (1.3) can be written as 

x' = A ■ X + a. (1.8) 

Similarly, seeing (Ax^) as a vector Ax, the square of the interval (1.4) becomes As^ = 
Ax* ■ r] ■ Ax. (The superscript “t” denotes transposition.) Then, since Ax' = A ■ Ax, 
demanding invariance of the square of the interval under ( 1 . 8 ) amounts to the equality 

As'^ = Ax* ■ (A*r 7 A) ■ Ax = Ax* ■ rj ■ Ax = As^, 

to be satished for any Ax. Because rj is non-degenerate, this implies that the matrix A 
satishes 

A* 77 A = rj. (1.9) 

Definition. The Lorentz group (in four dimensions) is 


0(3,1) = L = {A e M(4, M) \A'r]K = r/} , 


( 1 . 10 ) 


where M(4, M) denotes the set of real 4x4 matrices. More generally, the Lorentz group 
in d space-time dimensions, 0(d — 1,1), is the group of real d x d matrices A satisfying 
property (1.9) for the d-dimensional Minkowski metric matrix rj = diag(—1,1,1,..., 1). 

d—1 times 


Remark. This definition is equivalent to saying that the rows and columns of a Lorentz 
matrix form a Lorentz basis of M**, that is, a basis {cq, Ci, 62 ,..., e^-i} of d-vectors Cq, such 
that = pag. The Lorentz group in d space-time dimensions is a Lie group of real 

dimension d{d — l)/2. This is analogous to the orthogonal group 0(d), defined as the set 
of d X d matrices O satisfying O'^O = I, where I is the identity matrix. In particular, the 
rows and columns of an orthogonal matrix form an orthonormal basis of M**. 

Definition. The group consisting of inhomogeneous transformations (1.8), where A 
belongs to the Lorentz group, is called the Poincare group or the inhomogeneous Lorentz 
group. Its abstract structure is that of a semi-direct product 0(3,1) x where is 
the group of translations, the group operation being given by 

(A, a) ■ (A', a') = (A • A', a + A ■ a'). 

Of course, this definition is readily generalized to d-dimensional space-times upon replac¬ 
ing 0(3,1) by 0(d — 1,1) and by M**. We will revisit the definition of the Lorentz 
and Poincare groups at the end of subsection 3.1, with the tools of pseudo-Riemannian 
geometry. Apart from that, in the rest of these notes, we will mostly need only the homo¬ 
geneous Lorentz group (1.10) and we will not really use the Poincare group. We stress, 
though, that the latter is crucial for particle physics and quantum field theory [27,28] . 
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1.3 Subgroups of the Lorentz group 

The defining property (1.9) implies that each matrix A in the Lorentz gronp satisfies 
det(A) = ±1. This splits the Lorentz group in two disconnected subsets, corresponding 
to matrices with determinant +1 or —1. In particular, Lorentz matrices with determinant 
+1 span a subgroup of the Lorentz group 0(3,1) = L, called the proper Lorentz group 
and denoted S0(3,1) or L^. It is the set of Lorentz transformations that preserve the 
orientation of space. 

Another natural subgroup of L can be isolated using (1.9), though in a somewhat less 
obvious way. Namely, consider the 0 — 0 component of eq. (1.9), 

~ (^°o) + A*oA*o = Tjoo = ~1- 

This implies the property 

(A*^o) = 1 + A*oA*o > 1, (1-11) 

valid for any matrix A in the Lorentz group. The inequality is saturated only if A*o = 0 
for all i = 1,2,3. Since the inverse of relation (1.9) implies At^A* = r] for any Lorentz 
matrix A, we also find 

(A%)' = 1 + AU°*>1, (1.12) 

with equality iff A*^* = 0 for all i = 1,2, 3. Thus, in particular, |A°o| = 1 iff A°j = A*o = 0 
for all i, in which case the spatial components A*^ of A form a matrix in 0(3). Just as the 
determinant property det(A) = ±1, the inequality in (1.11) splits the Lorentz group in 
two disconnected components, corresponding to matrices with positive or negative A°o. 
Note that the product of two matrices A, A', with positive A'^o and A'^Qj is itself a matrix 
with positive 0 — 0 component: 

(A-A')°o = AV'% = A%A'% + AU'‘o 

> A%A'%-|A°,A''o| 

( 1 . 11 ),( 1 . 12 ) , - , - , „ , 

> - |aU'*o| > 0. (1.13) 

(In the very last inequality we applied the Cauchy-Schwarz lemma to the spatial vec¬ 
tors whose components are A°j and A'*o-) Therefore, the set of Lorentz matrices A with 
positive A'^o forms a subgroup of the Lorentz group, called the orthochronous Lorentz 
group and denoted 0(3,1)^ or . As the name indicates, elements of are Lorentz 
transformations that preserve the direction of the arrow of time. 

Given these subgroups, one defines the proper, orthochronous Lorentz group 

SO(3,l)^ = lI = L+nL\ 

which is of course a subgroup of L. In fact, we will see at the end of this subsection that 
this is the maximal connected subgroup of the Lorentz group. The rows and columns 
of Lorentz matrices belonging to form Lorentz bases with a future-directed time-like 
unit vector, and with positive orientation. The group of orientation-preserving rotations 
of space, SO(3), is a natural subgroup of L\_, consisting of matrices of the form 

^ , with R e SO(3). 
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(1.14) 




the time-reversal matrix 


Note that L+ can be generated by adding to 

/-I 0 0 
0 1 0 
0 0 1 
\ 0 0 0 

Similarly, can be obtained by adding to L\_ the parity matrix 

/I 0 0 

0-10 
0 0-1 
\0 0 0 

More generally, the whole Lorentz gronp L can be obtained by adding T and P to L\_. 
(Note that T and P do not commnte with all matrices in This shonld be contrasted 
with the case of 0(3) and SO(3), where the three-dimensional parity operator belongs to 
the center of 0(3).) 


0 \ 
0 
0 

-V 


(1.16) 


0 \ 

0 

0 

V 


(1.15) 


1.4 Boosts and rapidity 

We have jnst seen that any rotation, acting only on the space coordinates, is a Lorentz 
transformation. In the langnage of inertial frames, this is obvions: if the spatial axes of 
a frame B are rotated with respect to those of an inertial frame A (and provided the 
time coordinates in A and B coincide), then B is certainly an inertial frame. The same 
wonld be trne even in Galilean relativity [7] . In order to see effects specific to Einsteinian 
special relativity, we need to consider Lorentz transformations involving inertial frames 
in relative motion. 



Fignre 2: Two inertial frames A and B related by a rotation of their spatial axes (the 
third space direction is omitted). The clocks of A and B are synchronized. 
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1.4.1 Boosts 


Call A and B the inertial frames used by Alice and Bob, with respective coordinates 
(x^) and (x'^). Suppose Bob moves with respect to Alice in a straight line, at constant 
velocity v. Without loss of generality, we may assume that the origins of the frames A 
and B coincide. (If they don’t, just apply a suitable space-time translation to bring them 
together.) By rotating the spatial axes of A and B, we can also choose their coordinates 
to satisfy x'^ = and x^ = x'^. Then, the only coordinates of A and B that are related 
by a non-trivial transformation are and Finally, using parity and time- 

reversal if necessary, we may choose the same orientation for the spatial frames of A and 
B, and the same orientation for their time arrows. 



Figure 3: The frame B (in red) is boosted with respect to A (in black) with velocity v 
along the x^ axis. The coordinates and x^ coincide with x'^ and x'^. In principle, the 
clocks of A and B (not represented in this figure) need not tick at the same rate. 


Under these assumptions the relation between the coordinates of A and those of B 
takes the form 


x''^ = AV with (A^) 


g 0 0\ 
r s 0 0 
0 0 10 ’ 
\0 0 0 1 / 


where p, q, r and s are some real coefficients such that A belongs to — in particular, 
p > 0. Since B moves with respect to A at constant velocity v (along the x^ direction), 
the coordinate x'^ of B must vanish when x^ = vt = nx°/c. By virtue of linearity, we 
may write 

x'^ = 7(n) • ^x^ — -x°j , 

where 7 (n) is some n-dependent, positive coefficient (on account of the fact that the direc¬ 
tions x^ and x'^ coincide). Demanding that A satisfies relation (1.9), with the restrictions 
p > 0 and 7 (n) > 0, then yields 


/ 7('^) —j{v)-v/c 0 0\ 

— 7 (n)-n/c 7 (n) 0 0 

0 0 10 ’ 

\ 0 0 0 1 / 


and 7 (n) 


1 

a/1 — 


(1.17) 


A Lorentz transformations of this form is called a boost (with velocity v in the direction 
x^). In particular, reality of A requires |n| to be smaller than c: boosts faster than light 
are forbidden. 


Boosts give rise to the counterintuitive phenomena of time dilation and length con¬ 
traction. Let us briefly describe the former. Suppose Bob, moving at velocity v with 
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respect to Alice, carries a clock and measures a time interval At' in his reference frame; 
for definiteness, suppose he measures the time elapsed between two consecutive “ticks” 
of his clock, and let the clock be located at the origin of his reference frame. Call V the 
event “Bob’s clock ticks for the hrst time at his location at that moment”, and call Q 
the event “Bob’s clock ticks for the second time (at his location at that time)”. Then, in 
Alice’s coordinates, the time interval At separating the events V and Q is not equal to 
At'; rather, according to (1.17), one has At = 'j{v)At'. Since 'y{v) is always larger than 
one, this means that Alice measures a longer duration than Bob; Bob’s time is “dilated” 
compared to Alice’s time, and j(v) is precisely the dilation factor. This phenomenon is 
responsible, for instance, for the fact that cosmic muons falling into Earth’s atmosphere 
can be detected at the level of the oceans even though their time of flight (as measured 
by an observer standing still on Earth’s surface) is about a hundred times longer than 
their proper lifetime. In subsection 4.3, we will see that boosts also lead to surprising 
optical effects on the celestial sphere. 


1.4.2 Notion of rapidity 

Although the notion of velocity used above is the most intuitive one, it is not the most 
practical one from a mathematical viewpoint. In particular, composing two boosts with 
velocities v and w (in the same direction) does not yield a boost with velocity v + w. It 
would be convenient to hnd an alternative parameter to specify boosts, one that would 
be additive when two boosts are combined. This leads to the notion of rapidity [5,9], 

x{v) = argtanh(n/c), (1.18) 


in terms of which the boost matrix (1.17) becomes 


A 


/ cosh X 
— sinh X 
0 

V 0 


— sinh X 

coshx 


0 

0 


0 

0 

1 

0 


0 \ 

0 

0 

V 




(1.19) 


One verihes that the composition of two such boosts with rapidities Xi and X 2 is a boost 
of the same form, with rapidity Xi + X 2 - 


Rapidity is thus the additive parameter specifying Lorentz boosts. It exhibits the fact 
that boosts along a given axis form a non-compact, one-parameter subgroup of It also 
readily provides a formula for the addition of velocities: the composition of two boosts 
with velocities v and w is a boost with rapidity x(n) + equivalently, according to 

(1.18), the velocity V of the resulting boost is 

V -\- w 

V = c ■ tanh(x(n) -t- x('^)) = c ■ tanh (argtanh(n/c) -f argtanh(i/;/c)) =- 

1 + vw/c^ 

This is the usual formula for the addition of velocities (in the same direction) in special 
relativity [7]. 

As practical as rapidity is, its physical meaning is a bit obscure: the dehnition (1.18) 
does not seem related to any measurable quantity whatsoever. But in fact, there exist 
at least three different natural dehnitions of the notion of “speed”, and rapidity is one of 
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them [10]. To illustrate these dehnitions, consider an observer, Bob, who travels by train 
from Brussels to Paris [29], and measures his speed during the journey. For simplicity, 
we will assume that the motion takes place along the x axis of Alice, an inertial observer 
standing still on the ground. 

A first notion of speed he might want to dehne is an “extrinsic” one: he lets Alice 
measure the distance between Brussels and Paris, and the two clocks of the Brussels and 
Paris train stations are synchronized. Then, looking at the clocks upon departure and 
upon arrival, he dehnes his velocity as the ratio of the distance measured by Alice to 
the duration of his trip, measured by the clocks in Brussels and Paris. The inhnitesimal 
version of velocity is the usual expression v = dx/dt, where x and t are the space and 
time coordinates of an inertial frame which, in general, is not related to Bob. (In the 
present case, these are the coordinates that Alice, or any inertial observer standing still 
on the ground, would likely use.) 

A second natural dehnition of speed is given by proper velocity. To dehne this notion. 
Bob still lets Alice measure the distance between Brussels and Paris, but now he divides 
this distance by the duration that he himself has measured using his wristwatch. The 
inhnitesimal version of (the x component of) proper velocity is u = dx/dr, where r 
denotes Bob’s proper time, dehned by 

— ^dx"^ ( 1 - 20 ) 

along Bob’s trajectory. (In (1.20), it is understood that Bob’s trajectory is written as 
{t, x{t)) in the coordinates {t, x) of Alice, but the value of dr would be the same in any in¬ 
ertial frame with the same direction for the arrow of time, by virtue of Lorentz-invariance 
of the interval, eq. (1.6).) If Bob’s motion occurs at constant speed, the relation between 
the X component of proper velocity and standard velocity is u = j(v) ■ v, as follows from 
time dilation. 

Finally, Bob may decide not to believe Alice’s measurement of distance, and that he 
wants to measure everything by himself. Of course, sitting in the train, he cannot measure 
the distance between Brussels and Paris using a measuring tape. He therefore carries an 
accelerometer and measures his proper acceleration at each moment during the journey. 
Starting from rest in Brussels (at proper time r = 0 say), he can then integrate this 
acceleration from r = 0 to r = s to obtain a measure of his speed at proper time s. This 
notion of speed is precisely the rapidity introduced above [10], up to a conversion factor 
given by the speed of light. Indeed, assuming that Bob accelerates in the direction of 
positive x, his proper acceleration at proper time r is the Lorentz-invariant quantity [6,30] 



It is the value of acceleration that would be measured by a “locally inertial observer”, that 
is, an observer whose velocity coincides with Bob’s velocity at proper time r, but who is 
falling freely instead of following an accelerated trajectory. The integral of this quantity 
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along proper time, provided Bob accelerates in the direction of positive x, is thus 


I{s) 




1 / dx/dr 
\dt/dr 


1/2 




2 / d'^t/dr'^ 
\d?x/dT’^ 


( 1 . 21 ) 

( 1 . 22 ) 


where we used the definition ( 1 . 20 ) of proper time, which implies 


dx\^ 2 f dt\^ 2 dxd^x 2^'^ 

dr) dr dr'^ dr dr'^ 


^ since Bob’s proper velocity along x vanishes at proper time r = 0, 
the integral ( 1 . 22 ) can be written as 

/■«(«) _ /•“(») _ 

I{s) = / du\/l — v'^/c^ = / d ( 7 (n) ■ v) \/l — /(? 

Jo Jo 

-—= c • argtanh(n(s)/c) ^ c • x(^^(s)), (1-23) 

where u{s) and n(s) are Bob’s proper velocity and velocity at proper time s. This is 
precisely the relation we wanted to prove: up to the factor c, the integral ( 1 . 21 ) of proper 
acceleration coincides with rapidity. In subsection 4.3, we will also see that rapidity is 
the simplest parameter describing the effect of Lorentz boosts on the celestial sphere. 



1.5 Standard decomposition theorem 

We now derive the following important result, which essentially states that any proper, 
orthochronous Lorentz transformation can be written as a combination of rotations to¬ 
gether with a standard boost (1.17). We closely follow [5]. 

Theorem. Any A G can be written as a product 

A = RMx)R2, (1.24) 

where Ri and R 2 are rotations of the form (1.14) and L{x) is a Lorentz boost of the 
form (1.19). The decomposition (1.24) is called standard decomposition of a proper, 
orthochronous Lorentz transformation. There are many such decompositions for a given 

A. 

Proof. Let A G Let a denote the vector in whose components are the coefficients 
(A^o). If n = 0, then A^^ = 0 and A is of the form (1.14). In that case the decomposition 
(1.24) is trivially satisfied with Ri = L{x) = I and R 2 = A. If a 7 ^ 0 , let ei denote one 
of the two unit vectors proportional to a (ci = Aa, (ei)^ = 1); write its components as 
(q?!, 02 , as). Let also 62 and 63 be two vectors in such that the set { 61 , 62 , 63 } be an 
orthonormal basis of with positive orientation [i.e. the 3x3 matrix whose entries are 
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the components of Ci, 62 and 63 belongs to SO( 3 )); denote their respective components 
as (/?!,/32,/ds) and (71,72,73)- Then consider the rotation matrix 


Ri 

The product RiK takes the form 


/I 0 0 0 A 


/A% 

0 C^i 0^2 0^3 


A'o 

0 Pi P 2 Ps 


A% 

\0 7i 72 73 / 


VA^o 



/I 

0 

0 

0\ 


0 


0:2 

03 


0 

Pi 

P 2 

Ps 


VO 

7i 

72 

73 / 


A°i 

A°2 

AS\ 


(AS 

A°i 

AS 

AS\ 

A\ 

A'2 

AS 


X 

X 

X 

X 

A^i 

A\ 

AS 


0 

Ai 

A 2 

/^3 

A^i 

AS 

as) 


V 0 

I'l 

^2 

vz j 


(1.25) 


where the x’s are unimportant numbers and where (/ii,/i2,/is) and (iii,ii2,^3) are the 
components of two mutually orthogonal unit vectors in (because RiK G so the 
rows and columns of this matrix form a Lorentz basis); let us denote these unit vectors 
by /2 and /s, respectively. Let also /i be the (unique) vector such that {/i, /2, /s} be an 
orthonormal basis of with positive orientation. Dehne the rotation R2 by 


/I 0 0 0\ 

0 Ai /ii z/i 

0 A 2 /i2 ^2 
\0 A3 /i3 Z/3/ 


where (Ai,A2,A3) are the components of fi. Then, the product R1AR2 reads 


0 

0 

A^i 

AS 

> 

0 

CO 


(1 

0 

0 

o\ 


(AS 

X 

k 

l\ 

X 

X 

X 

X 


0 

Ai 

/Xl 



X 

X 

m 

n 

0 

/Xl 

fJ‘2 

h3 


0 

A2 

1^2 

1^2 


0 

0 

1 

0 

V 0 

1^1 

^2 

i^3 / 


VO 

A3 

As 

^s) 


V 0 

0 

0 

V 


Since R1AR2 belongs to the two last rows of ( 1 . 26 ) must be orthogonal to the two 
hrst ones, so the 2x2 matrix 

n A 

\m nJ 

must vanish. Thus R1AR2 is block-diagonal. 


R 1 AR 2 


B 0 
0 1 


5 


with B G SO(l, 1 ) 1 . This implies that 

„ f cosh Y — sinh v 

^ = • n n 

y—sinhy coshy 

for some y, so that R1AR2 is a pure boost of the form ( 1 . 19 ). In other words, writing 
Ri = (.Ri)“^ and R2 = (.^2)”^, the decomposition ( 1 . 24 ) is satished. ■ 
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Remark. The standard decomposition theorem holds in any space-time dimension d > 
3. The proof is a straightforward adaptation of the argument used in the four-dimensional 
case. (In d = 2 space-time dimensions, there are no spatial rotations and the proper, 
orthochronous Lorentz group SO(l, 1)^ only contains boosts in the spatial direction. In 
this sense, the standard decomposition theorem is trivially staisfied also in d = 2 .) 

1.6 Connected components of the Lorentz group 

In a general topological group G (and in particular in any Lie group), we call connected 
component oi g & G the set of all elements in G that can be reached by a continuous path 
starting g. In particular, we denote by Gf. the connected component of the identity 
e G G. A group is connected if it has only one connected component — that of the 
identity —, in which case G = G^- 

Proposition. G^ is a normal subgroup of G. Furthermore, the set of connected com¬ 
ponents of G coincides with the quotient group G/G^- 

Proof. We first prove that Gg is a subgroup of G. Let hi and ^2 belong to Gg, and let 

7 i : [0,1] —>■ G : 11 --)- 7i(^) and 72 : [0,1] —)■ G : 11 --)- 72 (^) 

be two continuous paths such that 7 i( 0 ) = 72 ( 0 ) = e and 7 i(l) = hi, 72 ( 1 ) = ^ 2 . Then 
the path 

7 i ■ 72“^ : [0,1] ^ G : f ^ 7i(^)72(^)“^ 

joins e to hihf^. Therefore hihf^ belongs to Gg, and the latter is a subgroup of G. 

Let us now show that Gg is a normal subgroup. Let h G Gg and let g & G. Consider 
the element ghg~^ in G. Since h belongs to the connected component of the identity, 
there exists a continuous path 7 : [0,1] —)■ G : f 1 —)■ 7(f) such that 7 ( 0 ) = e and 7 ( 1 ) = h. 
But then the map 

gjg-^ : [0,1] ^ G : f ^ g-f{t)g-^ 

is also a continuous path in G, joining 7 ( 0 ) = e to 7 ( 1 ) = ghg~^. Therefore ghg~^ G Gg. 
Since this is true for any h & Ge and any G G, Gg is a normal subgroup of G. 

We now turn to the second part of the proposition. Suppose first that gi and g 2 belong 
to the same connected components in G; let 7 : [0,1] —)■ G : f 1 —)■ 7(f) be a continuous 
path such that 7 ( 0 ) = gi and 7 ( 1 ) = g 2 . Then, gf^j^t) is a continuous path joining the 
identity to gf^g 2 , so gf^g 2 belongs to Gg. Therefore, the cosets giG^ and 5 ' 2 Gg, seen as 
elements of the quotient group G/G^, coincide. (Since Gg is a normal subgroup of G, 
G/Gg is indeed a group.) 

Conversely, suppose the cosets giGe and g 2 Ge coincide as elements of GjGe- Then 
there exists an h in Gg such that g 2 = 5 'ih, and a path 7 : [ 0 , 1 ] —)■ Gg such that 7 ( 0 ) = e 
and 7 ( 1 ) = h. But then the path gi’jit) joins gi to g 2 , so gi and g 2 belong to the same 
connected component of G. ■ 

Let us now apply this proposition to the Lorentz group. First observe that is 
connected, as follows from the standard decomposition theorem; in (1.24), each factor 
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can be linked to the identity by a continnons path, and this remains trne for the prodnct 
of these factors^. Thus, L\ is the connected subgroup of the Lorentz group, and the 
connected components of the latter coincide with the quotient L/L\. As noted below 
expression (1.16), each element of the Lorentz group L can be reached by adding parity 
and/or time-reversal to the connected Lorentz group L\. Thus, the quotient L/L\ is the 
group Z 2 X Z 2 generated by P and T and the Lorentz group has exactly four connected 
components, denoted as follows: 


4: 

det(A) = 1 

and 

A% > 1, 

Li-- 

det(A) = 1 

and 

A% < 1, 

Li: 

det(A) = — 1 

and 

A°o > 1, 

Li: 

det(A) = — 1 

and 

A% < 1. 


As already mentioned, L+ = L\_ U L\. and = L\_ U are subgroups of the Lorentz 
group. Note that L\ U hi is also a group. 

Remark. Analogous results hold for the Lorentz group 0{d — 1,1) in any space-time 
dimension d. The properties | det(A)| = 1 and jA'^ol > 1 remain true and the dehnition of 
the proper Lorentz group SO{d — 1,1) and the orthochronous Lorentz group 0{d — 1,1)^ 
are straighforward generalizations of L+ and L^. Similarly, one dehnes SO{d — 1,1)^ = 
SO{d — 1,1) n 0{d — 1,1)^. The standard decomposition theorem (1.24) remains true, 
provided SO(3) is replaced by SO(d — 1). In particular, SO(d — 1,1)^ is the connected 
subgroup of the Lorentz group and the latter splits in four connected components. The 
transition between different components is realised by the generalization of the time- 
reversal and parity matrices (1.15) and (1.16). (In odd space-time dimensions, parity 
is not just diag(l, —1,..., —1), since that matrix belongs to the proper Lorentz group. 
Rather, in odd dimensions, parity is P = diag(l, 1,—1,...,—1).) 

2 Lorentz groups and special linear groups 

Having dehned the Lorentz group, we now turn to its realization as the group SL(2,C) 
of volume-preserving linear transformations of C^. Since the method used to derive this 
isomorphism has a wide range of applications, we will use it repeatedly in this section, 
proving three different isomorphisms along the way: hrst, in subsection 2.1, we relate 
SO(3) to SU(2). Then, in subsection 2.2, we turn to the isomorphism between SL(2,M) 
and the Lorentz group in three space-time dimensions. Finally, in subsection 2.3, we 
establish the announced link between L)/ and SL(2,C)^. We end in subsection 2.4 by 
mentioning (without proving them) higher-dimensional generalizations of these results 
and their relation to division algebras. 

Before dealing with specihc constructions, let us review a general group-theoretic 
result. Let G and H be groups, f : G ^ H a homomorphism. Then, the kernel Ker(/) of 
/ is a normal subgroup of G and the quotient of G by Ker(/) is isomorphic (as a group) 

^This argument relies in particular on the fact that the group SO (3) of orientation-preserving rotations 
is connected. 

^These results have important implications for representation theory; we shall not discuss those 
implications here and refer to [5,28,31] for more details. 


17 




to the image of /: 


( 2 . 1 ) 


G/Ker(/) = Im(/). 

The proof is elementary, as it suffices to observe that the map 

G/Ker(/) -> Im(/) : ^Ker(/) ^ f{g) 

is a bijective homomorphism, that is, the sought-for isomorphism. All isomorphisms 
exposed in this section will be obtained using that method; we will construct well-chosen 
homomorphisms that will lead us to the desired isomorphisms through relation ( 2 . 1 ). 


2.1 A compact analogue 

Here we establish the isomorphism between SO(3) and the quotient of SU(2) by its center, 
following [5]. This relation is important for our purposes both because of the simplicity 
of the example, and because of its role in the isomorphism between the Lorentz group in 
four dimensions and SL(2, C). We begin by reviewing briefly the main properties of the 
unitary group in two dimensions. 


2.1.1 Properties of SU(2) 

The unitary group U(2) in two dimensions is the group of linear transformations of 
that preserve the norm ||( 2 ;,ta)|p = \z\ -f It consists of 2 x 2 complex matrices U 
that are unitary in the sense that 

f/^f/ = I 2 = 2 X 2 unit matrix, (2-2) 

where f denotes hermitian conjugation {U^ = (U^)*). It follows that the lines and 
columns of each matrix U G U(2) dehne an orthonormal basis of for the scalar product 
(z,w) ■ (z',w') = z*z' w*w'. By virtue of the dehning property (2.2), each U G U(2) 
has I det(f/)| = 1. In particular, we dehne the special unitary group SU(2) in two dimen¬ 
sions as the subgroup of U(2) consisting of matrices U with unit determinant, det(17) = 1. 

For later purposes, we will need to know some topological properties of SU(2). De¬ 
manding that the matrix 



belong to SU(2) imposes the conditions \a\^ + |/3p = I 7 P -|- = 1, ay* -|- 135* = 

a(3* + 7 ( 5 * = 0 and a(5 — /^y = 1. These requirements are solved hy 5 = a* and 7 = —(3*, 
so each matrix in SU(2) can be written as 


U 


1 a 

'’1 

with a 

U/5* 

a* J ' 



1 . 


Thus, each element of SU(2) is uniquely determined by four real numbers ai, a 2 ,/5i ,/52 
such that + 0(2 3- Pf + (32 = 1- These numbers dehne a point on the unit 3-sphere^. 

"'^Recall that the n-sphere S'" is defined as the set of points in that are located at unit distance 
from the origin. 


18 



Furthermore, this description is not redundant (two different quadruples lead to two dif¬ 
ferent elements of SU(2)), so SU(2) is homeomorphic^ to S^, as a topological space. In 
particular, SU(2) is connected and simply connected. 


Finally, recall that the center of a group G is the set of elements that commute with 
all elements of G. In particular, it is an Abelian normal subgroup of G. It is easy to show 
that the center of SU(2) consists of the two matrices 


l2 = 



-I2 



(2.3) 


and is thus isomorphic to Z 2 . This observation will be important in the next paragraph, 
and we will use it again once we turn to the Lorentz group in four dimensions. 


2.1.2 The isomorphism 

Theorem. One has the following isomorphism: 

SO(3) ^ SU(2)/Z2, (2.4) 

where Z 2 is the center of SU(2). In other words, SU(2) is the double cover of SO(3), and 
it is also its universal cover. 


Proof. Consider the space V of 2 x 2 traceless Hermitian matrices. Each matrix X G V 
can be written as X = xVj (with implicit summation over i = 1, 2, 3), where the x*’s are 
real coefficients, while the cxj’s are Pauli matrices 


CTl = 





(2.5) 


The space V is obviously a three-dimensional real vector space. Note that det(X) = 
—= — ||a;|p, where ||.|| denotes the Euclidean norm in In addition, as a vector 
space, V is isomorphic to the Lie algebra su(2) of SU(2). The group SU(2) naturally acts 
on its Lie algebra by the adjoint action, for which U G SU(2) maps X G V on f/Xf/1 G V. 
This action is a representation of SU(2), that is, a homomorphism from SU(2) into the 
linear group of Furthermore, it preserves the norm in V = in the sense that 

det{UXU^) = det(X) = -||x||2 Vf/ G SU(2), VX G H. 

Thus, the adjoint action of SU(2) on consists of orthogonal transformations and we 
can define a homomorphism 


/:SU(2)^0(3):17^/[t/], (2.6) 

where the 3x3 matrix f[U] is given by the condition 

Ux^aiU^ = fpf.x^ai Vx G that is, UajU^ = f[Uy.ai Vj = 1,2,3. (2.7) 

It remains to compute the image and the kernel of the map / so defined. 


^By definition, two topological spaces are homeomorphic if there exists a continuous bijection, mapping 
the first space on the second one, whose inverse is also continuous. 
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We begin with the image. By (2.7), the entries of the matrix f[U] are quadratic 
combinations of the entries of U, so / is continuous. Since SU(2) is connected, the image 
of / must be contained in the connected subgroup of 0(3), that is, SO(3). To prove the 
isomorphism (2.4), we need to show that the opposite inclusion holds as well, i.e. that 
any matrix in SO(3) can be written as f[U] for some U G SU(2). 

The latter statement actually follows from a geometric observation [5]: any rotation 
R{n, if) of (around an axis n, by an angle ip) can be written as the product R{n, p) = 
ST of two reflexions S and T with respect to planes whose intersection is the rotation 
axis, the angle between the planes being half the angle of rotation. Explicitly, S and T 
can be written as 


S ■. X ^ X — 2{x ■ m)m and T ■. x ^ x — 2{x ■ q)q, (2.8) 

where m and q are unit vectors orthogonal to the planes corresponding to the reflexions S 
and T, respectively. Then, dehne the matrices M = mVj and Q = q^Ci. These matrices 
are Hermitian, traceless, have determinant —1 and square to unity. Dehning similarly 
X = x^ai, the reflexions (2.8) can be written as 

S :X ^ -MXM and T : X ^ -QXQ. 

Therefore, the rotation R{n, p) = ST (with T acting hrst) acts on X according to 

R{n, p):X^ MQXQM. (2.9) 

But now note that the product U = MQ is such that W = QM with f/f/1 = MQQM = 
I 2 , and it has unit determinant. Hence U = MQ belongs to SU(2) and the transformation 
(2.9) is of the form (2.7) dehning the homomorphism /. Hence we can write the rotation 
R{n,p) as f[U]. This proves that / is surjective on S0(3). 

To conclude the proof of (2.4) we turn to the kernel of /, that is, the inverse image of 
the unit element in SO(3). Saying that f[U] is the identity in SO(3) is just saying that 
UXU^ = X for any X in V, which in turn is equivalent to saying that U commutes with 
all X’s. But, when this holds, U also commutes with any e*^. Since X is Hermitian and 
traceless, this is the same as saying that U commutes with all elements of SU(2), i.e. that 
U belongs to the center of SU(2). The latter consists of the two matrices (2.3), which 
form a group Z2. I 

Remark. From the dehnition (2.7) of the homomorphism / and the form of the Pauli 
matrices, one can easily read off the explicit expression of the orthogonal matrix f[U], 
for U an element of SU(2): 

( Re{ad + bc) lm{ad — be) Re{dc — bd) \ 

lm{dd + bc) Re{ad — bd) lm{ac — bd) j . (2.10) 

Relab — cd) lm(ab — cd) |(|ap — — |cp + |dp)y 

This formula exhibits the fact that / is insensitive to an overall change of sign in the 
entries of its argument, since the right-hand side only involves quadratic combinations 
of those entries. In particular, it implies that the kernel of / must contain the matrices 

(2.3), 
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2.2 The Lorentz group in three dimensions 

The Lorentz group in three space-time dimensions is the group 0(2,1), as dehned in 
subsection 1.2. Its connected subgroup is S0(2,1)"*^. We will show here that this connected 
group is isomorphic to the quotient of SL(2, M) by its center. Before doing that, we review 
a few topological properties of SL(2, M). For the record, the results of this subsection will 
play a minor role in the remainder of these notes, so they may be skipped in a hrst 
reading. 


2.2.1 Properties of SL(2, M) 


The group SL(2, M) is the group of volume-preserving linear transformations of the plane 
It can be seen as the group of real 2x2 matrices with unit determinant; 


SL(2,M) 




M(2,M) 


ad — be 



Lemma. The group SL(2,M) is connected, but not simply connected. It is homotopic 
to a circle; in particular, the fundamental group of SL(2,]R) is isomorphic to Z. 


Proof. Let 

S=(“ fjeSU2,Ki. 

Since det(S') 7 ^ 0, the vectors (a, 6 ) and (c, d) in are linearly independent. We can 
therefore hud three real numbers a, fd and 7 such that the set 

{d ■ (a,6),/3 ■ (a, 6 )-t -7 ■ (c, d)} ( 2 . 11 ) 

be an orthonormal basis of Equivalently, there exists a matrix 


K = 



such that the product 


0 = KS = 


aa ab 

13a + ^c ^b + j3d 


be an orthogonal matrix (since the lines and columns of an orthogonal matrix form an 
orthonormal basis). We can choose -|- 6 ^, making a positive. Since det(.^) = 

dy = det((!l) = ±1, we may choose the orientation of the basis (2.11) so that 7 = 1/d, 
i.e. O G SO(2). Thus, any matrix S G SL(2,]R) can be written as 


5 = R-^O = KO, with O G SO(2) and K 


(k 0 \ 
\m l/kj 


for some m G M and A; G M strictly positive. Now, the set of triangular matrices of the 
form 


r M 

\m 1/kJ 


with fc > 0 and m G M 


is homeomorphic to M x M’*', which is connected and has the homotopy type of a point. 
On the other hand, SO(2) is homeomorphic to a circle. This shows that SL(2,]R) is 
connected and homotopic to a circle. In particular, the fundamental group of SL(2,M) is 


Z. 
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Note that the center of SL(2, M) is the same as that of SU(2) (see eq. (2.3)): it consists 
of the identity matrix and minus the identity matrix, forming a group isomorphic to Z 2 . 


2.2.2 The isomorphism 
Theorem. There is an isomorphism 

SO( 2 ,l)^ = SL( 2 ,M)/Z 2 , (2.12) 

where Z 2 is the center of SL(2,M). In other words, SL(2,M) is the double cover of the 
connected Lorentz group in three dimensions (but it is not its universal cover, since it is 
not simply connected). 

Proof. Our goal is to build a well chosen homomorphism mapping SL(2, M) on SO(2,1)^. 
Consider, therefore, the space V of real, traceless 2x2 matrices, that is, the Lie algebra 
of SL(2,M). Each matrix X in V can be written as 


X = (implicit sum over /i = 0,1, 2), 

where the x^’s are real numbers, while the t^’s are the following matrices: 


to = 




ti = 




(2.13) 


(These matrices are generators of the Lie algebra of SL(2,M).) Note that, with this 
convention, the determinant of X is, up to a sign, the square of the Minkowskian norm 
of the corresponding vector {x^)\ 

det(X) = —rj^yX^x’^ = —x^. (2-14) 

There is a natural action of SL(2,M) on the space V. Namely, with each S G SL(2,M), 
associate the map 

\ : X ^ SXS-\ (2.15) 

This is the adjoint action of SL(2,M). It is linear and it preserves the determinant, since 
det(S'XS'“^) = det(X). In addition, thanks to (2.14), each map (2.15) can be seen as a 
Lorentz transformation acting on the 3-vector [x^). We can thus dehne a map 

/:SL(2,M)^0(2,l):5^/[^], 

where the 3x3 matrix f[S] is given by 

St^x^S-^ = tjisr^x’' V {x^ e 

or equivalently, 

St^S-^ = tJ[Sr^ V/r = 0,l,2. (2.16) 

Because {ST)X{ST)~^ = S{TXT~^)S~^ for all matrics S', T in SL(2,M), the map / 
is obviously a homomorphism. Furthermore, by (2.16), the entries of f[S] are quadratic 
combinations of the entries of S'; therefore / is continuous. In particular, since SL(2, M) is 
connected, the image of / is certainly contained in the connected Lorentz group SO(2,1)"^. 
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It remains to prove that / is surjective on SO(2,1)"^ and to compute its kernel. We 
begin with the former. Let therefore A G SO(2,1)^. The standard decomposition theorem 
(1.24) adapted to d = 3 states that A can be written as A = RiL{x)R 2 , where Ri and R 2 
belong to the SO(2) subgroup of 0(2,1), while L{x) is a standard boost of the form (1.19) 
with the last line and last column suppressed. To prove surjectivity of / on S0(2,1)1, we 
need to show that there exist matrices Si, S2 and S{x) in SL(2,M) such that 

f[Si] = Ri, f[S2]=R2 and f[S{x)] = Lix). (2.17) 


We begin with the rotations. If S belongs to the S0(2) subgroup of SL{2,'R), i.e. 


sin9\ 
smt^ cos UJ 

for some angle 9, then formula (2.16) gives 

/I 0 0 \ 

f[S] = ( 0 cos26* —sin26* . (2.19) 

yo sin 26* cos 26* J 

This implies that any rotation R in SO(2,1)1 can be realised as R = /[S'] for some matrix 
S of the form (2.18) in SL(2,]R). Thus, to prove surjectivity of / as in (2.17), it only 
remains to find a matrix S{x) such that f[S{x)] = L{x) be a standard boost with rapidity 
y in three space-time dimensions. Again, using (2.16), one verifies that the matrix 


^(a) 


0 \ 
V 0 


is precisely such that 


f[S{x)] 



— sinh y 
cosh y 


0 



( 2 . 20 ) 


which was the desired relation. We conclude that / is surjective on SO(2,1)1, as expected. 


Finally, to establish (2.12), we need to show that the kernel of / is isomorphic to Z 2 . 
The proof is essentially the same as for SU(2). Indeed, saying that S G SL(2,M) belongs 
to the kernel of / means that S commutes with any linear combination of the generators 
(2.13). But this implies that S commutes with all elements of SL(2, M), i.e. that S belongs 
to the center of SL(2,M), which is just Z 2 . I 


Remark. Using (2.16), one can write down explicitly the homomorphism / as 

l(a^ -|- 6^ -|- -f dS) l(a^ — + — dS) —ab — cd\ 

l(a^ -|- 6^ — — d?) l(a^ — + d?) —ab -|- cd j , 

—ac — bd bd — ac ad + be J 

where the argument of / on the left-hand side is a matrix in SL(2,M). We already 
displayed two special cases of this relation in equations (2.19) and (2.20). As in the 
analogous homomorphism (2.10) for SU(2), the fact that the right-hand side only involves 
quadratic combinations of the entries of the SL(2,]R) matrix implies that / is insensitive 
to overall signs, so that its kernel necessarily contains Z 2 . 
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2.3 The Lorentz group in four dimensions 

We now turn to the analogue of the previous isomorphism for the Lorentz group in 
four dimensions, following [5] once again. As usual, we will begin by reviewing certain 
topological properties of SL(2,C), turning to the isomorphism later. 

2.3.1 Properties of SL(2, C) 

The group SL(2,C) is the set of volume-preserving linear transformations of the vector 
space C^. It can be seen as the group of complex 2x2 matrices with unit determinant: 

SL(2,C) = |(“ €M(2,C) 



Lemma. The group SL(2,C) is connected and simply connected. 

Proof. We use essentially the same technique as for the group SL(2, M) in subsection 2.2. 
Let 

S=(“ ;;)€SL(2,C). 

Then det(S') = 1 7 ^ 0, so the vectors (a, 6 ) and (c, d) in are linearly independent. We 
can then hnd three complex numbers d, ^ and 7 such that 

{d • (a, 6 ),;d • (a, 6)-^7 • (c, d)} ( 2 . 21 ) 

be an orthonormal basis of C^. In other words, there exists a matrix 


K = 



such that the product 


U = KS 


f aa ah \ 

\j3a + f]b + l3dj 


be unitary (since the lines and columns of a unitary matrix form an orthonormal basis). 
We may take d~^ = '^/|a) 2 ~+~|^, so that d is real and strictly positive. Since det(.A) = 
0:7 = det([/) is a complex number with unit modulus, and since the second basis vector 
in the orthonormal basis ( 2 . 21 ) is determined up to a phase, we may choose det(A') = 1 , 
that is, 7 = 1/d. Thus U G SU(2) and 7 is also a strictly positive real number. We 
conclude that any matrix in SL(2, C) can be written as 


S = R-^U = KU, with U E SU(2) and K 


(k 0\ 
\rn 1/kJ 


for some m E C and k E M. strictly positive. SU(2) is diffeomorphic to S^, so it is 
connected and simply connected. Furthermore, the group of triangular matrices of the 
form 


r M 

\rn 1/kJ 


with k > 0 and m E C 


is homeomorphic to C x M’*', which is also connected and simply connected. As a conse¬ 
quence, SL(2,C) itself is connected and simply connected. ■ 
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Note the topological difference between SL(2,]R) and SL(2,C); both are connected, 
but only SL(2,C) is simply connected, while SL(2,M) is homotopic to a circle. This 
subtlety has important consequences for (projective) unitary representations of SL(2,M) 
and SL(2, C), and, accordingly, for those of the Lorentz groups in three and four dimen¬ 
sions [27,28,32,33]. 

One can also verify that the center of SL(2, C) consists of the two matrices (2.3) and 
is thus isomorphic to Z 2 , exactly as in the case of SU(2) and SL(2,M). 

2.3.2 The isomorphism 
Theorem. There exists an isomorphism 


4 ^ SL( 2 ,C)/Z 2 , 


( 2 . 22 ) 


where Z 2 is the center of SL(2,C). In other words, SL(2,C) is the double cover of the 
connected Lorentz group in four dimensions, and it is also its universal cover. 

Proof. Proceeding as for the Lorentz group in three dimensions, we wish to build a 
homomorphism / : SL(2,C) —)■ 0(3,1) and compute its image and its kernel. Consider, 
therefore, the vector space V of 2 x 2 Hermitian matrices. It is a real, four-dimensional 
vector space: any matrix X G V can be written as 


f —x^ — ix^\ 

J ' 


(2.23) 


where the x^’s are real coefficients. This can also be written as X = x^t^^ where tq 
denotes the 2 x 2 identity matrix, while® ti = —cxi, T 2 = <J 2 and = <73 in terms of the 
Pauli matrices (2.5). Then, just as in (2.14), 

det(X) = det(x'^r^) = (x®)^ — (x^)^ — (x^)^ — (x^)^ = —rj^yX^x'' = —x^. (2.24) 

Let us now define an action of SL(2, C) on V; for each matrix S G SL(2, C), we consider 
the linear map 

V ^ V : X ^ SXS^. (2.25) 

This action preserves the determinant because det(S'XS'l) = det(X). By (2.24), this 
amounts to preserving the square of the Minkoswkian norm of the four-vector (x^), so 
the transformation (2.25) can be seen as a Lorentz transformation acting on (x^). We 
thus define a map 


/;SL(2,C)^ 0(3,1) :5^/[^], 

(2.26) 

where the 4x4 matrix f[S] is given by 


Sr.x^S^ = T^nsr.x’^ V (xfy G 

(2.27) 

or equivalently. 


ST„St = V,j = 0,l,2,3. 

(2.28) 


®The choice of signs is slightly unconventional here; it will eventually ensure that the action (4.8) of 
Lorentz transformations on celestial spheres coincides with the standard expression (3.23) of conformal 
transformations. 
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Since {ST)X{STy = S{TXT'^)S\ the map / is obviously a homomorphism. Further¬ 
more, by (2.28), the entries of f[S] are quadratic combinations of the entries of S'; so / 
is continuous. In particular, since SL(2,C) is connected, the image of / is contained in 
the connected Lorentz group SO(3,1)^ = L^. 

It remains to prove that / is surjective on and to compute its kernel. We have just 
seen that Im(/) C by continuity, so as far as surjectivity is concerned, we need 
only prove the opposite inclusion. Let therefore A belong to L\_. By the standard 
decomposition theorem (1.24), we can write A as a standard boost L{x), for some value of 
the rapidity y, sandwiched between two (orientation-preserving) spatial rotations: A = 
RiL{x)R 2 - Thus, in order to prove surjectivity of / on it suffices to hnd three 
matrices Si, S 2 and S{x) SL(2,C) such that 

f[Si] = Ri, f[S 2 ] = R 2 , f[S{x)] = L{x) (2.29) 

since in that case /[S'iS'(x)S' 2 ] = RiL{x)R 2 = A. Now, the restriction of the homomor¬ 
phism (2.26) to the SU(2) subgroup of SL(2, C) is precisely the homomorphism (2.6) that 
we used to prove the isomorphism SO(3) = SU(2)/Z2. We know, therefore, that there 
exist matrices Si and S 2 in SU(2) such that conditions (2.29) hold. As for the matrix 
S'(x), we make the educated guess 

= (”nh(x/2l rsh((/2)) = - sinh(x/2)T,. 

The image of S{x) under / can be read off from the property 

S{x)x^TfiS{x)^ = {x^ coshy — sinh y) Tq -|- (—sinhy -|- coshy) Ti -f- x‘^T 2 + 

Comparing with the dehnition (2.27) of /, we see that f[S{x)] is precisely the standard 
Lorentz boost L(y), as written in (1.19). In conclusion, the homomorphism / is surjective 

'f 

on L^. 

The last missing piece of the proof is the computation of the kernel of /. By dehnition, 
the kernel consists of matrices S such that SXS"^ = X for any X G V. Taking X = I 2 , we 
see that S must belong to SU(2). Then, taking X = ai, we observe that S must belong 
to the center of SU(2), that is, Z 2 . ■ 


Remark. As usual, the dehnition (2.28) can be used to compute explicitly the matrix 
f[S], when S belongs to SL(2,C). The result is 


/ 




(2.30) 


/|(|a|2 + |6|2 + |cf + |d|2) 

—Re {dc -|- bd] 

Im |dc -|- 

+ \b\^ - \c\^ - m 


—Re [ab + cd} 
Re {dd -|- dc} 
—Im [ad + be] 
—Re \ab — cd\ 


Im |ad -|- cd} 
—Im {ad — be] 
Re {ad — be 
Im I ad — cd 


l(|ap-|6|2+|42_|i|2)\ 

—Re {dc — dd} 

Im {dc — bd] 

i(|ap-|6p-|cp + |d|V 


involving only quadratic combinations of the entries of the argument of /, which exhibits 
the fact that the kernel of / must contain Z 2 . 


26 






2.3.3 Examples 

For future reference, let us display two specific families of matrices in SL(2, C) correspond¬ 
ing to rotations around the axis and boosts along that axis, associated respectively 
with the Lorentz matrices 

0 0 — sinh 
1 0 0 
0 1 0 
0 0 coshx / 

Demanding that these matrices be of the form f[S] for some S G SL(2,C) determines S 
uniquely, up to a sign, through formula (2.30). One thus hnds that rotations by 6 around 
are represented by 

S,„1 = ±( „ (2.31) 

while boosts with rapidity y along x^ are given by 

Sb.„.. = ±( (, (2.32) 

We will put these formulas to use in subsection 4.3, when describing the effect of Lorentz 
transformations on celestial spheres. 

2.4 Lorentz groups and division algebras 

In the two previous subsections, we proved the two very similar isomorphisms 

SO(2,l)^ = SL(2,M)/Z2 and SO(3,1)^ ^ SL(2, C)/Z 2 . (2.33) 

From this viewpoint, going from three to four space-time dimensions amounts to changing 
M into C. Now, from Hurwitz’s theorem it is well known that M and C are only the two 
hrst entries of a list of four normed division algebras (see e.g. [34]): the two remaining 
algebras are the set El of quaternions and the set O of octonions. Given this classihcation 
and the apparent coincidence (2.33), it is tempting to ask whether similar isomorphisms 
hold between certain higher-dimensional Lorentz groups and special linear groups of the 
form SL(2,EI) or SL(2,0). This turns out to be the case indeed: one can prove that the 
connected Lorentz groups in six and ten space-time dimensions satisfy [35-39] 

SO(5,l)^ ^ SL(2,e)/Z2 and SO(9,1)^ ^ SL(2,0)/Z2. 

We will not prove these isomorphisms here. We will not even attempt to explain the 
meaning of the last isomorphism in this list, given that octonions are not associative, 
so that what we call “SL(2, O)” is not obvious. Let us simply mention, as a curiosity, 
that these isomorphisms are related to the fact that minimal supersymmetric gauge held 
theories (with minimally coupled massless spinors) can only be dehned in space-time 
dimensions 3, 4, 6 and 10. More generally, the relation between spinors and division 
algebras spreads all the way up to superstring theory. We will not study these questions 
here and refer for instance to [38,39] for many more details. 


/I 0 0 0\ 

0 cos 6 — sin 6 0 

0 — sin 9 cos 9 0 

\0 0 0 1 / 


and 


/ cosh y 
0 
0 

y— sinh y 
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3 Conformal transformations of the sphere 

This section is a differential-geometric interlude: setting the Lorentz group aside, we will 
show that the quotient SL( 2 ,C)/Z 2 may be seen as the group of conformal transforma¬ 
tions of the 2-sphere. Accordingly, our battle plan will be the following. We will first 
define, in general terms, the notion of conformal transformations of a manifold (subsec¬ 
tion 3.1). We will then apply this definition to the plane and the sphere (subsections 
3.2 and 3.3) and classify the corresponding conformal transformations. These matters 
should be familiar to readers acquainted with conformal field theories in two dimensions, 
which we briefly mention in subsection 3.4. Although some basic knowledge of differential 
geometry may be useful at this point, it is not mandatory for understanding the text, 
as our presentation will not be cast in a mathematically rigorous language. We refer for 
instance to [40,41] for an introduction to differential geometry. 

3.1 Notion of conformal transformations 

In short, a conformal transformation of some space is a transformation which preserves 
the angles. To define precisely what we mean by “angles” (and hence conformal trans¬ 
formations), we will now review at lightspeed the notions of manifolds and Riemannian 
metrics. 

3.1.1 Manifolds and metrics 

Roughly speaking, a (smooth) manifold is a topological space that looks locally like a 
Euclidean space M”, the number n being called the dimension of the manifold. Here, by 
“locally”, we mean “upon zooming in on the manifold”; any point on the manifold admits 
a neighbourhood that is homeomorphic to Two typical examples of n-dimensional 
manifolds are itself, and the sphere S"'. Thanks to the locally Euclidean structure, we 
can define, at each point p of a manifold A4, a vector space consisting of vectors tangent 
to Ai at p; this vector space is called the tangent space of A4 at p, denoted TpAi. If we 
think of a manifold Ad as a smooth set of points embedded in some higher-dimensional 
Euclidean space then the tangent space TpAi is literally the (affine) hyperplane in 
that is tangent to Ai at p, endowed with the vector space structure inherited from 
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Figure 4; A manifold M embedded in The point p belongs to the manifold, and 
the plane tangent to M at p is the tangent space TpAi. In this drawing, we take N = 3 
and the manifold is two-dimensional. The grid was added to emphasize the fact that the 
manifold looks, locally, like a plane 


Given a vector space, it is natural to endow it with a scalar product, allowing one to 
compute norms of vectors and angles between vectors. Since a manifold Ad has a tangent 
space at each point, one would like to define a scalar product in the tangent space at 
each point of Ad; a metric does precisely this job. 

Definition. A (Riemannian) metric 5 ^ on Ad is the data of a scalar product in each 
tangent space of Ad, such that this scalar product varies smoothly on Ad [42]. More 
precisely, a metric is a symmetric, positive-definite, smooth tensor field 

g M ^ : p ^ Pp, (3.1) 

where Pp is the aforementioned scalar product in TpAd; 

Qp : TpM. X TpAd —)• M : (u, tn) H->■ gp{v, w). (3.2) 

The requirements of symmetry and positive-definiteness ensure that gp satisfies all the 
standard properties of a scalar product. This definition can be extended to pseudo- 
Riemannian metrics, that is, symmetric tensor fields such as (3.1) that are not necessarily 
positive-definite. In particular, we will see below that d-dimensional Minkowski space- 
time is the manifold endowed with the pseudo-Riemannian metric (3.9). 

3.1.2 Examples 

To illustrate concretely the above definition, let us consider a few simple examples of 
metrics on the manifold Ad = We can endow this manifold with global (Cartesian) 
coordinates {x, y) such that any point p G is identified with its pair of coordinates. 
Our first example is the Euclidean metric, whose expression in Cartesian coordinates is 

g = dx"^ -|- dy"^. (3.3) 
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To explain the meaning of this notation, let us pick a point (x, y) in and two vectors 
V and w at that point, with respective components {vx,Vy) and {wx,Wy). Their scalar 
product is given by (3.2), i.e. 

g(x,y){v, w) = {dx^ + dy"^) [(n„, Vy), (w^,, . (3.4) 

By dehnition, upon acting on a vector, dx gives the x-component of this vector. (In the 
standard language of differential geometry, dx is the differential of x, that is, the one-form 
dual to the vector held d/dx associated with the coordinate x on M^.) The notation dx^ 
is then understood as the operation which, upon acting on two vectors, gives the product 
of their components along x. A similar dehnition holds for dy and dy"^, except that they, 
of course, give ^/-components of vectors. Applying these rules to (3.4), we hnd that the 
metric (3.3) dehnes the standard Euclidean scalar product, 

g(x,y) {v, w) = VxWx + VyWy. (3.5) 

Of course, one can dehne more generally the Euclidean metric on to be 5 ^ = (dxi)^ -|- 
■ ■ ■ {dxdY ill terms of Cartesian coordinates. 



Figure 5; The plane and the Cartesian coordinates used to label its points. A point 
with coordinates (x,//) is also represented, and n, w are two vectors at that point. Their 
scalar product with respect to the Euclidean metric (3.3) is given by (3.5). 


A slightly less trivial example of metric on is given by 

dx^ -I- d//^ 

= (l + a;2 +2/2)2’ 

where (x, y) is the point at which the metric is evaluated. If then v and w are two vectors 
at (x, 2 /), with the same components as before, their scalar product with respect to this 
new metric is 

where we have used once more the rule saying that dx (resp. dy), upon acting on a vector, 
gives the x-component (resp. //-component) of the vector. By contrast to the Euclidean 
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scalar product (3.5), this expression depends explicitly on the point (x, y). In other words, 
if we take two families of vectors on with constant components {vx,Vy) and {wx,Wy) 
at each point of the plane, their scalar product will vary as we move on 

Of course, the metric (3.6) that we picked was chosen for illustrative purposes only: 
any positive function on multiplying dx"^ + would give a (generally position- 
dependent) Riemannian metric on More generally, any position-dependent, real 
quadratic combination of dx’s and di/’s, 

A{x, y)dx^ + 2B{x, y)dxdy + 0(x, y)dy‘^ , (3.7) 

is a Riemannian metric on as long as A and AC — are everywhere positive. If 
V and w are two vectors at (x, y) with the same components as before, their scalar 
product with respect to the metric (3.7) is A{x,y)vxWx + 2B{x,y)vxWy -f C(x,y)vyWy. 
Again, the generalization of these considerations to is straightforward: in terms of 
Cartesian coordinates Xi,...,Xd, the most general Riemannian metric on takes the 
form gij{xi, ...,Xd)dxidxj (with implicit summation over i,j = l,...,d), where {gij) is a 
symmetric, positive-definite matrix at each point. 

3.1.3 Angles and conformal transformations 

Metrics can be used to define norms and angles on tangent spaces of a manifold. Indeed, 
suppose we are given a manifold Ai endowed with a metric g. Let p be a point in Ai 
and let n be a tangent vector of Ai at p. Then, the norm of v is naturally defined to be 
||n|| = [pp(n,n)]^'^^. Furthermore, if v and w are two vectors at p, the angle 9 between 
them is defined (up to a sign) by 


cos 9 = 


9p{v,w) 

i|ni| ■ \\w\ 


Note that this definition is blind to the local normalization of the metric. Indeed, suppose 
we define two metrics g and g' on Ai, such that 


{9')p = ^(p) ■ 9p VpeAf, 

where is some smooth, positive real function on Ad. In other words, let us assume that 
g and g' are proportional, the proportionality factor being position-dependent. Then, 
these two metrics define the same angles. The proof is elementary: if v and w are two 
vectors at p G Ai, then the cosine of the angle between these vectors is 

9p(^,w) _ _ Vt{p)gp{v,w) __ 9'p{'v,w) 

[9p{v, v) gp{w, w)]^/^ [n{p)gp{v, v) n{p)gp{w, [g'^{v, v) g'^iw, w)] ’ 

which is obviously independent of whether we choose to use the metric g or the metric 
g' . This observation will be crucial in the following pages. 


Given a manifold Ai, it is natural to wonder what modifications Ai may undergo, 
such that these modifications “preserve the structure” of Ad. To answer this question, we 
must specify precisely what is the structure we wish to preserve. Clearly, a first feature 
we would like to preserve when deforming Ad is its local Euclidean structure. This leads 
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to the notion of diffeomorphisms: by definition, a diffeoniorphisni of a manifold is a 
smooth, invertible map 0 : —?■ A1 such that the inverse map 0“^ be smooth as well'. 
In this sentence, the word “smooth” means “that preserves the local Euclidean structure 
in a continuous and differentiable way”. In heuristic terms, a diffeomorphism of A1 is a 
smooth, invertible deformation of Ai when the latter is seen as a rubber space. 



Figure 6: A diffeomorphism of the sphere. The arrows represent how points of the sphere 
move as the diffeomorphism is applied. Under the action of the diffeomorphism, points are 
shuffled, shapes are distorted, but the motion is smooth, preserving the local Euclidean 
structure of the manifold. 


Suppose now we pick a manifold Ai endowed with a metric g, and consider a diffeo¬ 
morphism (j) : Ai ^ Ai of that manifold. Since the diffeomorphism is a deformation of 
Ai, it will in general affect distances and angles on AT; in other words, a general diffeo¬ 
morphism does not preserve the metric on Ai and maps the original metric g on some 
new metric g'. (In precise terms, what we call the transformed metric is the pull-back 
of g by 0, that is, g' = (p*g.) This gives a motivation for defining certain subclasses of 
diffeomorphisms that preserve some part (or the entirety) of the metric structure, i.e. dif¬ 
feomorphisms for which the new metric g' has certain properties in common with the first 
metric g. 

Definition. A conformal transformation of AT is a diffeomorphism 0 : AT —?■ AT such 
that the original metric g and the transformed metric g' define the same angles (possibly 
up to signs). 

Given the property, shown above, that proportional metrics define identical angles 
(possibly up to signs), it is easy to write down an explicit formula for what we mean by 
a conformal transformation: it is a diffeomorphism for which the transformed metric g' 
is related to the original metric g as 

a'p = ^{p)av Vp e AT, (3.8) 

where G is some smooth, positive function on AT. When VL{p) = 1 for all p in AT, we say 
that the diffeomorphism 0 is an isometry: it preserves not only the angles, but also the 
norms defined by the metric g. Of course, conformal transformations and isometries can 
also be defined for pseudo-Riemannian metrics. 

diffeomorphism is thus a smooth generalization of the notion of homeomorphism, the word 
“smooth” replacing the word “continuous”. 
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Remark. We are now equipped with the tools needed to restate in differential-geometric 
terms the definition of the Lorentz and Poincare groups, originally described in subsection 
1.2. Namely, define d-dimensional Minkowski space to be the manifold endowed with 
a pseudo-Riemannian metric g such that there exist global coordinates (x^) on in 
which the metric takes the form 

9 = = —{dx^Y + dx^dx\ z = 1,..., d — 1, (3.9) 

with the Minkowski metric matrix written in (1.5) for the case d = A. In the 

language of subsection 1.1, the coordinates (x^) are those of an inertial frame. Then the 
isometry group of this manifold is precisely the Poincare group in d dimensions, acting on 
according to (1.8), and the stabilizer for this action is the Lorentz group 0(d — 1,1). 
From this viewpoint, the property (1.6) of invariance of the interval is simply the defining 
criterion for the transformation to be an isometry. 


3.2 Conformal transformations of the plane 

To illustrate the definition of conformal transformations in the simplest possible case, 
let us consider the plane endowed with the Euclidean metric (3.3). To make things 
technically simpler, we see as the complex plane C and introduce a complex coordinate 
z = X + iy^ m terms of which the metric (3.3) becomes g = dzdz (with z the complex 
conjugate of z). Then a generic diffeomorphism is a map 

0 : C —)■ C : 2 : H-)■ Z(z, f), (3.10) 

where the function Z generally depends on both z and Demanding that 0 be a con¬ 
formal transformation imposes certain restrictions on this function, which we now work 
out. 


Since 0 maps z on Z and since the metric g is just dzdz, it is natural that the 
transformed metric be 

g'^ = dZdZ\^, (3.11) 

where the subscript 2 ; means that both sides are evaluated at the point 2 ;. (This is just 
the definition g' = (jfg applied to (3.10).) Here the differential of Z is 

dZ ^ dZ ^ 

dz L = -^dz -I- -T^dz. 
oz oz 

Plugging this expression (and its complex conjugate) in (3.11), we find 


9 


! 

Z 


dZ dZ 
dz dz 


dz -|- 


dZ dZ 
dz dz 


dz -\- 


dZ dZ dZ dZ 
dz dz ^ dz dz 


dzdz. 


According to the definition surrounding eq. (3.8), requiring that 0 be a conformal trans¬ 
formation amounts to demanding that this expression be proportional to g^ = dzdz. The 
terms involving dz"^ or dz^ must therefore vanish, which is the case if and only if 


dZ 

dz 


0 or 


dZ 

dz 


0 . 


(3.12) 


In other words, the function Z must depend either only on z, or only on z. The latter 
possibility represents conformal transformations that change the orientation of (they 
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map an angle 6 on an angle —0), and we will discard them from now on. Thns, a diffeo- 
morphism (3.10) is an orientation-preserving conformal transformation of provided Z 
is a fnnction of z only, that is, a meromorphic fnnction. Fnrthermore, locally, any snch 
fnnction is admissible®. 

Of conrse, this is not the end of the story since (3.10) mnst be a smooth bijection. 
This restricts the form of Z{z) even fnrther. To begin with, Z{z) mnst be regnlar, so 
Z{z) mnst be an analytic fnnction 

Z{z) = A + Bz + Cz"^ + Dz^ -!-•••. (3.13) 

The zeros of Z{z) are the points that are mapped on the origin Z = 0. Since Z{z) mnst be 
an injective map, there can be only one snch zero, say 2 ;*. If this zero is degenerate, then 
the map Z{z) will not be injective in a neighbonrhood of that zero. (If 2 ; is snfhciently 
close to 2 ;* and if 2 ;* is a zero of Z(z) with order n > 1 , then 2 ; is mapped by Z on n different 
points, and Z{z) cannot be injective.) Thns, in (3.13), the coefficients of all powers of 
2 ; higher than one mnst vanish, i.e. C = D = 0, etc. In other words, the fnnction Z{z) 
mnst be linear in 2 ;. Finally, reqniring Z{z) to be snrjective imposes that the coefficient 
of the z-linear term be non-zero. We conclnde that all conformal transformations of the 
plane are of the form 

Z{z) = az + b, with a, b G C and a 7 ^ 0. (3-14) 

These transformations natnrally split in three classes: 

Translations z ^ z + b, 6 G C; 

Rotations z h-)■ e^^z, 6 * G M; 

Dilations z 1 —)■ e~^z, x G M. 

We will see in the next snbsection that these transformations may also be seen as (a 
snbclass of) conformal transformations of the sphere. 

Before going fnrther, let ns note one important detail: in deriving the set of conformal 
transformations (3.14), the fact that the metric g on C was the Enclidean metric (3.3) 
played a minor role. Indeed, we wonid have obtained the exact same set of transformations 
for any metric of the form f 2 (z, z)dzdz on the plane, since conformal transformations are 
blind to the mnltiplication of metrics by (positive) fnnctions. The only crncial point 
was that the metric be proportional to dzdz, since it is this property that led to the 
condition (3.12). The fnrther restrictions leading to (3.14), on the other hand, originated 
from topological (hence metric-independent) considerations. These observations will be 
essential in the following snbsection. 

3.3 Conformal transformations of the sphere 

We now tnrn to the main goal of this section: the classihcation of conformal transforma¬ 
tions of the sphere By dehnition, the latter is a two-dimensional manifold consisting 
of all points with Cartesian coordinates (xi, 0 : 2 , xs) in snch that x{ + X 2 + = r^, 

where r is some hxed (positive) radins. (The notation is nsnally reserved for the nnit 
sphere, with radins r = 1 , bnt here we will denote any sphere by regardless of its 
radins.) 

®Upon writing z = x -\- iy and Z{z^ z) = (a;, y) + lY{x, y) where X and Y are real functions on the 

plane, the first equation in (3.12) can be rewritten as the two Cauchy-Riemann equations for X and Y. 
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3.3.1 Stereographic coordinates 

The standard way to locate points on a sphere of radius r relies on polar coordinates 
6 G [0, tt] and (p G [0, 27r) defined by 

{ Xi = r sin 9 cos p, 

X2 = rsin^sin^j, 
x^ = r cos 9 

for any point {xi,X 2 ,X 3 ) belonging to the sphere. 



Figure 7: A sphere embedded in and the polar coordinates 9, p used to label its 
points. 


In the present case, however, it will be more convenient to use so-called stereographic 
coordinates, which will simplify the treatment of conformal transformations. These co¬ 
ordinates are defined as follows. Consider a point {xi,X 2 ,X 2 ) on the sphere, different 
from the south pole (0,0,—r). Then, there exists a unique straight line in passing 
through that point and the south pole. Explicitly, all points belonging to this line have 
coordinates (|/i, |/ 2 , 2 / 3 ) of the form 


yi = txi, 

92 = tX2, 

2/3 = t{xs + r)-r 


(3.15) 


where f is a parameter running over all real values. (The point corresponding to f = 0 
is the south pole, while t = 1 corresponds to {xi,X 2 ,Xs).) The straight line so obtained 
crosses the equatorial plane {{xi,X 2 , 0)|a;i, X 2 G M} at exactly one point, called the stereo¬ 
graphic projection of {x\, X 2 , X 3 ) through the south pole. The coordinates {x[,X 2 , 0) of this 
projection are obtained by setting 2/3 = 0 in eq. (3.15), that is, by taking t = r/{r -\- X 3 ), 
which gives 


rxi , r X 2 

-, Xn = -. 

r + X 3 r -f- X 3 


(3.16) 


We will refer to x\ and x '2 as the stereographic coordinates on the sphere. They can be 
combined into a single complex coordinate 


x\ -f ix '2 X\ -f iX 2 


r -F X3 


(3,17) 
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which is related to polar coordinates through 


z = e*‘^tan( 6 */ 2 ). 


(3.18) 


For future reference, note that the inverse of relation (3.17) gives {xi,X 2 ,X 3 ) in terms of 
z and z as 


Z Z f ^ /C' L , . 

Xi = r— —z, X 2 = -— —z, X 3 = r——z, (3.19) 

1 + ZZ I 1 + zz 1 + zz 

where we used the fact that Xi + X 2 + x"^ = r^. Of course, we could have carried out a 

parallel construction by projecting points of the sphere on the equatorial plane through 

the north pole; this would have given formulas analogous to (3.16) and (3.17), but with 
r + X 3 replaced by r — X 3 . 


r z — z 


1 — zz 



Figure 8 : The stereographic projection of a sphere centered at O through the south pole 
S. The two red crosses are points belonging to the sphere; the projection maps those 
points on the two red dots on the equatorial plane, following straight lines parametrized 
by eq. (3.15). The coordinates (xj, 0 : 2 , 0) of the projection of a point (xi, X 2 , X 3 ) are given 
by (3.16). 


The stereographic projection is a concrete illustration of the fact that a sphere is 
locally the same as a plane: any point on the sphere, other than the south pole, can be 
projected to the equatorial plane through the south pole. Points that are close to the 
north pole get projected near the origin z = 0] the whole northern hemisphere is projected 
in the unit disc \z\ < 1 , and the equator is left fixed by the projection, corresponding to 
the unit circle \z\ = 1. Points belonging to the southern hemisphere, on the other hand, 
are projected outside of the unit disc. In particular, points located near the south pole are 
projected far from the origin, at large values of |^|: as points get closer to the south pole, 
they get projected further and further away. In fact, one may view the infinitely remote 
point on the plane, the “point at infinity” z = 00 , as the projection of the south pole itself. 
(Of course, the actual projection of the south pole is ill-defined, so the point at infinity 
does not have a well-defined argument.) We conclude that the sphere is diffeomorphic to 
a plane, up to a point. More precisely, 

= C U {point at infinity} = CU {z = 00 }. (3.20) 

The representation of the sphere as a plane to which one adds the point at infinity is 
called the Riemann sphere [43]. This relation hints that some of the results derived above 
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for conformal transformations of the plane should be applicable to the sphere as well. In 
order to see concretely if this is the case, we first need to express the metric of a sphere 
in terms of the coordinate 2 ;. 


3.3.2 The metric on a sphere in stereographic coordinates 

The natural metric on a sphere follows from the definition of a sphere as a submanifold 
of Namely, endowing with the Euclidean metric dx^ + + dx"^, the metric on 

the sphere is simply 

g = {dxj + dxl + dxl) . (3.21) 

To express this metric in terms of stereographic coordinates, we use formula (3.17), from 
which it follows that the differential of 2 ; is 

_ {dxi + idx 2 ){r + X 3 ) - {xi + ix 2 )dx 3 
{r + x3y • 

On the sphere defined by x^ + xl + xl = r^, the differentials of Xi, X 2 and 0:3 satisfy the 
relation xidxi + X 2 dx 2 + = 0 , which can then be used to show that 


dzdz 


(r + xs)^ 


[dxl + dxl + dxl) I 


xf+X2+x'^=r‘^ 


In the last term of this expression we recognize the metric (3.21) on the sphere, whose 
expression in terms of z thus becomes 


gz = {r + X3Ydzdz 


4^2 

(1 + zzy 


dzdz, 


(3.22) 


where we used the third relation of (3.19) to write X 3 as a function of z and z. This metric 
is position-dependent, since it explicitly depends on 2 ;. In fact, up to the factor 4r^, it 
is precisely the metric (3.6) that we took as an example earlier on, written in terms of 
z = X + iy. The only subtlety is that, in contrast to (3.6) where x and y only take finite 
values, expression (3.22) must be understood as a metric on the Riemann sphere, where 
1^1 may be infinite. 


Crucially, the metric (3.22) is proportional to the Euclidean metric dzdz, which im¬ 
plies that, as far as conformal transformations are concerned, we can simply repeat the 
derivation carried out in subsection 3.2 for the plane. More precisely, if we demand that 
a diffeomorphism 0 : C U {^ = 00 } —C U {z = 00 } : z ^ Z{z,z) he a conformal 
transformation, the arguments that led to (3.12) remain true and the function Z must 
depend either only on z, or only on 0 . The latter choice corresponds to transformations 
that do not preserve the orientation of the sphere, so we will ignore them. Thus, any 
orientation-preserving conformal transformation of the sphere is a meromorphic function 
of the form z Z{z), and locally on the sphere this is all we can say. 


Globally, of course, this is not yet the end of the story, since we must further require 
that the function Z{z) be a diffeomorphism of the sphere — that is, a diffeomorphism 
of the plane C with the point at infinity added as in (3.20). This point will play a key 
role. Indeed, requiring that Z{z) be regular on C U {z = cx)} no longer means that Z{z) 
is analytic as in (3.13); rather, Z{z) now may (and should) have at least one pole, at z* 
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say, corresponding to the point that is mapped to the south pole Z(z*) = oo. Thus, Z(z) 
should now be a rational function of the general form 

A + Bz + Cz^ + Bz^ + --- 
“ A' + B'z + C'z^ + D'z3 + • • • ’ 

where the roots of the numerator (resp. denominator) correspond to the points that are 
mapped on the origin Z = 0 (resp. the point at infinity Z = oo), i.e. on the north pole 
(resp. the south pole). Since Z(z) must be an injective map, there must be one, and 
only one, point that is mapped to the north pole, and also exactly one other point that is 
mapped to the south pole. As in subsection 3.2, this requires that both the numerator and 
the denominator be linear functions of ; 2 . We can thus write any orientation-preserving 
conformal transformation of the Riemann sphere as 

(3.23) 

where a, b, c and d are complex numbers. Requiring this map to be surjective finally 
imposes that 

(c d) ^ 

This is the classification of conformal transformations of the sphere that we were 
looking for. Such transformations are also called Mobius transformations. They obviously 
contain the set of conformal mappings (3.14) of the plane, so that translations of z, 
rotations and dilations also represent conformal transformations of the sphere. However, 
there is now an additional two-parameter family of transformations of the form 

z !-)■ —b^fz, b G C*, 

corresponding to so-called special conformal transformations [11,12]. Such transforma¬ 
tions map the north pole on the south pole, and vice-versa. Any conformal transformation 
of the sphere can be obtained as the composition of a special conformal transformation, 
a translation, a rotation and a dilation (possibly in a different order). 




az + b 
cz -\- d' 


By construction, conformal transformations span a group, so it is worthwile to inves¬ 
tigate the group structure of the set of Mobius transformations. Clearly, formula (3.23) 
is blind to the overall normalization of the matrix in (3.24), since multiplying all entries 
of the matrix by the same non-zero complex number leads to the same transformation 
(3.23). We can thus assume, without loss of generality, that the non-zero determinant 

'a b^ 

A 

matrices in SL(2,C) that differ only by their sign define the same conformal transfor¬ 
mation, so the group of all non-degenerate transformations of the form (3.23) is actually 
isomorphic to the quotient 

SL(2,C)/Z2. (3.25) 


(3.24) is actually one, i.e. that the matrix 


belongs to SL(2, C). Furthermore, two 


In other words, according to (2.22), the set of orientation-preserving conformal transfor¬ 
mations of the sphere forms a group isomorphic to the connected Lorentz group in four 
dimensions. At this stage, this relation appears just as a coincidence of group theory: 
there seems to be no relation whatsoever between the Mobius transformations (3.23) and 
the original definition of the Lorentz group as a matrix group acting on The purpose 
of the next section will be to show that this apparent coincidence actually has a geometric 
origin, rooted in the structure of light-like straight lines in Minkowski space-time. 
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3.4 An aside: conformal field theories in two dimensions 

In the two previous subsections we have seen that any (orientation-preserving) conformal 
transformation of a two-dimensional manifold with a conformally flat metric oc dzdz can 
be written as a meromorphic function z ^ Z{z). Demanding that Z{z) be a bijection of 
the manifold imposes certain restrictions on the function Z, leading to (3.14) in the case 
of the plane, and (3.23) in the case of the sphere. However, in physical applications, it 
is often the case that “global” requirements such as bijectivity play a minor role. This is 
particularly true in the case of local quantum held theories®, whose properties are mostly 
determined by local (as opposed to global) considerations. 

This feature is of central importance in the context of conformal field theories in two 
dimensions [11,12]. By dehnition, a conformal held theory in d dimensions is a quantum 
held theory, dehned on a d-dimensional manifold A4 endowed with some metric that 
is invariant under conformal transformations of Al. In the case Ai = 8“^, with a metric 
proportional to dzdz in terms of stereographic coordinates, this leads to theories that are 
invariant under all Mobius transformations (3.23). However, the actual set of inhnitesimal 
symmetries of such theories {i.e. symmetries found without taking global issues into 
account) turns out to be much, much larger than the hnite-dimensional group (3.25). 
Indeed, since global requirements such as bijectivity play a secondary role, conformal 
held theories in two dimensions turn out to be invariant under all transformations that 
can be written locally as z Z{z), where Z{z) is any meromorphic function^®. This leads 
to an inhnite-dimensional symmetry algebra that constrains such theories in a extremely 
powerful way [46]. For instance, when combined with an additional symmetry property 
called “modular invariance”, conformal invariance of a two-dimensional held theory implies 
a universal formula for the entropy of that theory, known as the Cardy formula [47]. We 
will briehy return to conformal held theories in the conclusion of these notes. 


4 Lorentz group and celestial spheres 

So far we have seen that the connected Lorentz group in four dimensions, = SO(3,1)^, 
is isomorphic to the quotient SL(2 ,C)/Z2 . We have also shown that the latter arises as 
the group of orientation-preserving conformal transformations of the sphere. However, 
at this stage, the relation between the Lorentz group and the sphere appears as a mere 
coincidence. In particular, since the original Lorentz group is dehned by its linear action 
on a four-dimensional space, there is no reason for it to have anything to do with cer¬ 
tain non-linear transformations of a two-dimensional manifold such as the sphere. The 
purpose of this section is to establish this missing link. This will require hrst dehning a 
notion of “celestial spheres” in Minkowski space-time (subsection 4.1), and then comput¬ 
ing the action of Lorentz transformations on such spheres (subsection 4.2). Subsection 
4.3 is devoted to the analysis of the somewhat counterintuitive action of Lorentz boosts 
in terms of celestial spheres. Our approach is motivated by the notion of “asymptotic 
symmetries” in gravity [14], and will rely on a specihc choice of coordinates that sim- 

®We will not explain the meaning of “quantum field theory” here. For an introduction, we refer for 
instance to the textbooks [28,44]. 

^*^At this point we should mention that proving conformal invariance of a quantum theory may be a 
subtle issue when the curvature of the underlying manifold does not vanish, due to the Weyl anomaly 
[11,45]. We will not discuss these subtleties here. 
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plifies the description of null infinity in Minkowski space-time. The results as such are 
well known, and coordinate-independent — see for instance [3,4]. It should be noted 
that similar relations exists also in other space-time dimensions. For instance, in d = 3 
dimensions, the connected Lorentz group SO(2,1)"'^ acts on the celestial circles at null 
inhnity through projective transformations spanning a group SL( 2 ,M)/Z 2 , in accordance 
with the isomorphism (2.12). In this section, however, we will restrict our attention to 
the four-dimensional case. 

4.1 Notion of celestial spheres 

As explained in section 1, inertial observers in special relativity live in Minkowski space- 
time, which may be seen as the vector space Inertial coordinates consist of one time 
coordinate t or = ct, and three Cartesian space coordinates = (x*). (Latin 

indices run over the values 1, 2, 3.) Given such coordinates, there is a natural way to 
define a corresponding family of spheres. Namely, one may describe the spatial location 
of an event in terms of spherical, rather than Cartesian, coordinates, dehned as 

r = Wix^y -f ix'^y + ix'^Y = \/ x^x^ and z = ^ , (4.1) 

r -f 

where points on the sphere of radius r are labelled by the stereographic coordinates 
(3.17). In particular, the spatial coordinates x* = Xi take the form (3.19) when expressed 
in terms of z and Note that the parity transformation dehned by the matrix (1.16) 
acts on the coordinate 2 ; according to z ^ —1/z. 

For each non-zero r, we thus have a spatial sphere naturally associated with the in¬ 
ertial coordinates (x°, a:^). Since Lorentz transformations relate different sets of 

inertial coordinates through linear transformations x h-)■ a;' = A • a:, one might hope that 
the rewriting of these transformations in terms of spherical coordinates (4.1) could give 
rise to a “nice” action of the Lorentz group on spatial spheres, one that would make the 
relation to conformal transformations more apparent. This is not quite the case, how¬ 
ever; roughly speaking, the “celestial sphere” that we actually wish to define should be the 
sphere that an inertial observer looks at. This is not achieved by the sphere of radius r 
dehned by (4.1), because radial, ingoing light-rays emitted by the sphere need a non-zero 
time r/c to get from the sphere to the origin at r = 0 (which we take to be the position 
of the observer). Thus, we need to work a little more: we must somehow combine space 
and time coordinates so as to take into account the hnite velocity of light, and dehne the 
celestial sphere seen by an observer at some moment of time as an object living in the past. 

The argument just outlined hints at the right dehnition of what we would like to call 
a “celestial sphere”. Indeed, consider a radial light ray whose trajectory in space-time is 
described by^^ 

r = To ± ct = To ± 2 ; = constant. 

In particular, an ingoing radial light ray satisfies r = tq — ct where tq is some strictly 
positive initial radius. Along the trajectory of this light ray, the quantity 

u = ct-{-r = x'^ + r (4.2) 

^^Here z is of course the coordinate (4.1) locating points on the sphere, and not the z coordinate of a 
Cartesian coordinate system. 
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is constant; it represents the time at which the observer located at r = 0 sees the light 
ray. One can thus parametrize the time of emission of ingoing radial light rays by the 
value of u. Instead of using coordinates (x°,r, 2 ;) to locate events in space-time, one may 
then use the Bondi coordinates {u, r, z), in which case u plays the role of a time coordinate 
and is called advanced time. The situation can be depicted as follows: 





Figure 9: The Bondi coordinates u and r in Minkowski space-time. The time coordinate 

= ct points upwards. The wavy red line represents an incoming radial light ray, 
emitted from some non-zero distance r towards the observer located at r = 0. The light 
ray moves along one of the generators of the light cone dehned by m = cst. The hgure 
represents three-dimensional space-time, so the circle of radius r in this drawing would 
in fact be a sphere in our actual, four-dimensional, space-time. That sphere is spanned 
by the stereographic coordinate z in (4.1). 

In terms of Bondi coordinates, a sphere at constant r > 0 and constant m G M coincides 
with the sphere seen by an observer sitting at the origin r = 0 at time u. We can then 
dehne the celestial sphere at time u as the sphere located at an inhnite distance, r —>■ -1-cxd, 
and at a hxed value of u. It is the sphere of all directions towards which an observer 
at r = 0 can look [3], the reason for the name “celestial” being obvious in that context. 
(Celestial spheres are also sometimes called “heavenly spheres” [37].) Less obvious is the 
fact that this dehnition is the one needed to match Lorentz transformations and Mobius 
transformations, which will be the purpose of the next subsection. The region M x 
spanned by the coordinates u and 2 ; at r —)• -|-oo is called past null infinity [14]; it consists 
of events located at an inhnite distance from the observer, and it can be reached from the 
line r = 0 by following a past-directed null vector, that is, a vector whose norm squared 
vanishes with respect to the Minkowski metric (3.9). 
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Figure 10; A schematic representation of celestial spheres. As in Fig. 9, the time coordi¬ 
nate points upwards and the wavy red line represents an incoming radial light ray. The 
picture represents three-dimensional space-time. Accordingly, the red circle at the bot¬ 
tom of the image would really be a sphere — a celestial sphere — in our four-dimensional 
space-time. Past null inhnity is the cone on the lower half of the image; it is a manifold 
M X spanned by the advanced time u and the stereographic coordinate z. 


Remark. In these notes we define celestial spheres by using a specihc set of coordinates 
{u, r, z) in Minkowski space-time. There also exists a different definition, according to 
which the celestial sphere associated with a point in space-time is the projective space 
of its (past) light-cone, that is, the set of past-directed null directions passing through 
that point [3]. In such terms, the celestial sphere at time u that we dehned above is the 
set of past-directed null directions through the point with Bondi coordinates (r = 0 ,m). 
(This could be any point in space-time since the Minkowski metric is invariant under 
all space-time translations.) This sphere can be thought of as the complex projective 
line CP^ = S^, and Lorentz transformations span the group SL(2, C)/Z 2 of its projective 
transformations — which are nothing but Mobius transformations when seeing CP^ as 
a sphere 5^ [3]. The advantage of this projective viewpoint is that it is manifestly 
coordinate-independent, but we will not adopt this approach here. (See, however, the 
end of subsection 4.3.) 

4.2 Lorentz transformations acting on celestial spheres 

The Lorentz group is dehned as the set of linear transformations x x' = A - x between 
coordinates of inertial observers in Minkowski space-time. In the previous subsection we 
have introduced new, non-inertial, Bondi coordinates {u, r, z) associated with each choice 
of inertial coordinates (x^). In order to hnd the action of the Lorentz group on Bondi 
coordinates, we must express both x' and x in terms of the associated Bondi coordinates. 
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then rewrite the relation x' = A • x in Bondi coordinates and read off the Lorentz transfor¬ 
mation properties of {u,r,z). Since the relation between inertial coordinates and Bondi 
coordinates is non-linear, this procedure leads in general to cumbersome expressions for 
(«', r', z') in terms of {u, r, z) and of the matrix elements of a Lorentz transformation. 
Fortunately, we are not actually interested in the general relation between («', r', z') and 
{u,r,z), but only in its limit r —)■ -|-cx) with hnite u. Provided Lorentz transformations 
preserve that limit (which is to be expected since they are linear in inertial coordinates), 
keeping u' hnite, they correspond to well-dehned transformations of past null inhnity. 


4.2.1 Transformation of the radial coordinate 

Let us begin by computing the transformation law of the radial coordinate r under Lorentz 
transformations. By dehnition, the (square of the) radial coordinate r' associated with 
the inertial coordinates (x'^) is = (x'^)^ -|- (x'^)^ -|- (x'^)^. If now we assume that 
the coordinates x'^ are obtained by acting on certain coordinates x^ with a Lorentz 
transformation A, we have x'^ = K^yX^ and 

r'2 = (AV + A^x'+AV + Aiax^)' 

+ (A%x° + A^x^ + + A^sx^)^ 

-h (A^o 3^° + A^ix^ + A^gx^)^ . (4.3) 


The next step consists in expressing the coordinates x^ in terms of Bondi coordinates 
{u,r,z) through relations (3.19) and (4.2). Taking the limit r —)■ -|-oo while keeping u 
and ^ hxed, the only terms that survive in the parentheses are those proportional to r, 
which gives 


r '2 = 


a 1 *1 ^ + 2: ^ ^ *1 I - zz 

-A or A^ir- -z K 2-- -z + A or 


1 + zz 


‘ i 1 + zz 


1 + zz 


.2 „ , ^2 „ ^ + ^ , a 2 ^ , a 2 


-f -A^or + A^ir 


1 -F 


+ A^ 2 V 


i 1 + ZZ 


+ AV 


-\ 2 
ZZ 


1 + zz 


, aS + r z - z 

A'* 2 V 


-f -A^or + A^r 


1 + zz 


‘ i 1 + zz 


z + aV 


-\ 2 
ZZ 


1 + zz 


+ 0(r) 


(In particular, in that limit, we may replace x*^ by —r.) Here the terms of order r outside 
the parentheses are subdominant with respect to the terms of order coming from 
the parentheses. As the hnal touch, we take A to be a proper, orthochronous Lorentz 
transformation, i.e. an element of the connected Lorentz group L\_. We can then express 
all entries A^y of the Lorentz matrix in terms of complex numbers a, b, c, d forming a 
matrix in SL(2,C), as in eq. (2.30). Taking the square root to express r' in terms of r. 
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this gives the lenghty relation 


1 r 


r = -- 


{ac + bd + ac + bd) (1 + zz) + {ad + bc + ad + be) {z + z) 


21 + zz 

+ {ad — be — ad + be){z — z) — {de — bd + de — bd){l — zz) 

— (^{ac + bd — de — bd){l + zz) — {dd + be — ad — be) {z + z) 

_ ^ _ \2 

+ {ad — be + ad — be){z — z) + {ae — bd — ae + bd){l — zz )) 

— 1^1 — Ml zz) {ab — ed -\- ab — ed) {z z) 


+ {ab — ed — ab + ed){z — z) — (|ap — |6|^ — |c|^ + Mn(l ~ 


1/2 


+ 0 ( 1 ). 


At first sight, this expression looks terrible: if we were to expand all the products and 
parentheses in such a way that the argument of the square root be a sum of monomials 
in z, z and the numbers a, b, e, d (and their complex conjugates), then the sum would 
contain about 3 x 32! terms. Fortunately, as one can check by a straighforward but 
tedious computation, the terms of the sum conspire to give a very simple hnal answer; 




\az + + \ez + d\" 

1 + zz 


+ 0{l)=r-F{z,z) + 0{l). 


(4.5) 


This result shows that, as expected, Lorentz transformations do not spoil the limit 
r —)■ +CX): the leading effect of Lorentz transformations on r is just an angle-dependent 
rescaling by some function F{z, z). Furthermore, the occurrence of combinations such as 
az + b and ez + d is reminiscent of conformal transformations of the sphere, eq. (3.23). 
The 0{1) terms in (4.5) are subleading corrections that we will not write down, though 
they will play a role in the transformation law of advanced time. 


4.2.2 Transformation of advanced time 

Having derived the transformation law of the radial coordinate r (in the large r limit), 
we now turn to the transformation of the remaining coordinates u and x. We begin with 
the former; using the definition (4.2), we write 

u' = r' + et' = r' + x'^. (4.6) 

We are now supposed to express r' and x'^ in terms of unprimed coordinates using their 
Lorentz transformation laws, then write everything in terms of r, u and x, and read off the 
transformation law of u at r —)■ +cxd. But there is a subtlety in carrying out this procedure. 
Namely, we have just seen that the transformation law of r is r' = F{z,z) ■ r + 0(1); 
when plugged into (4.6), this implies that the transformation law of u should read 

u' = F{z, z) - r + 0(1) + 

Here the leading 0(r) term is dangerous: if there is nothing to cancel it, the limit r —)■ +cx) 
of the transformation law of u will be ill-dehned. The only way to get rid of this term is 


(4.4) 
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to cancel it against the leading 0{r) term in the transformation law of which is given 

by 


= A%a:°+AV 

‘ + |6P + |cp + - r) + ^ 

—(ab + cd — ab — cd)-^ —^ + (|ap — + kp — |rfk):j-^ 

Plngging this in expression (4.6) and nsing (4.5), one sees that the dangerons terms, pro¬ 
portional to r, cancel ont! This means that the limit r — )■ -|-cx) of the transformation law 
of u is well-dehned; in terms of observers, it means that if Alice and Bob are boosted with 
respect to each other and if Alice assigns a hnite valne u of advanced time to some event, 
then Bob will assign to it a Lorentz-transformed valne u' which is also hnite, thongh 
in general different, even if the event is located at an inhnite distance from both Alice 
and Bob. This cancellation of potentially divergent terms in the transformation law of u 
is actnally the very reason why celestial spheres are dehned at null inhnity rather than 
spatial inhnity. (The latter wonld correspond to the limit r —)■ -t-oo with the coordinate 
being kept hnite instead of m, and in that case the large r limit of the transformation 
law of wonld be ill-dehned.) 


— [ah + cd + ah + cd) 


z + z 
1 + zz 


Using the fact that the transformation law of u is well-dehned at r —>■ -|-oo (no 0{r) 
term), we know on dimensional gronnds that 

u' = G{z,z)-u + 0{l/r), (4.7) 

where G{z, z) is some nnknown fnnction. (Indeed, u and r are the only Bondi coordinates 
with dimension of length, so, npon expanding the transformation law of u in powers of r, 
the term of order zero in r mnst be proportional to u.) In particnlar, the ehect of Lorentz 
transformations on advanced time at inhnity is jnst an angle-dependent rescaling, jnst as 
the transformation (4.5) of the radial coordinate. The qnestion, then, is to compnte the 
rescaling G(z, z). 


Jnst as in the case of the radial coordinate, the compntation of the rescaling factor 
is straightforward, bnt cnmbersome. In particnlar, it reqnires evalnating the snbleading 
term in the transformation law of r, which we did not derive in (4.5) as it was inclnded 
in the 0{1) terms. This snbleading term can be fonnd by Taylor-expanding the transfor¬ 
mation law (4.3) of r in powers of 1/r, keeping u hxed. We will not write the details of 
this compntation here, and we simply display the hnal resnlt [14]: 


u 


/ 


u 

cz + dp 


+ 0(l/r). 


This is of conrse of the annonnced form (4.7). 


4.2.3 Transformation of stereographic coordinates 

The last case to be considered — and the most interesting one for onr pnrposes — is the 
transformation law of the z coordinate nnder Lorentz transformations in the large r limit. 
The compntation is more or less straightforward, as it only involves the dominant piece 
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of the transformation law of r, displayed in (4.5). To begin, one nses (4.1) to write the 
transformed coordinate as 

/I • /9 

, X + IX 

^ ! i /3~ ’ 

r + x'-^ 

where the primed coordinates on the right-hand side are obtained by acting with a Lorentz 
transformation A on nnprimed coordinates: 


/ 


F{z, z) ■ r + 


+ 0(l/r). 


(We nsed eq. (4.5) in writing this.) Expressing the x^’s in terms of Bondi coordinates 
throngh relations (3.19) and (4.2) and keeping u hnite, one Ends 


(A^o + *A^o)(l + zz) - (A\ iA‘^i){z + z) + i{h ^2 + 'iA‘^ 2 ){z - z) - (A^s -h iA^3)(l - zz) 
— \az + — \cz -I- dp -I- A3o(l + zz) — A^i(z + z) + iA^ 2 {z — z) — A33(l — zz) 


np to 1/r corrections. Finally, writing the entries A^j, of the Lorentz matrix as in 
eq. (2.30), both the nnmerator and the denominator of the last expression become certain 
complicated polynomials in a, b, c, d, z, and their complex conjngates. Fortnnately, many 
terms in these polynomials cancel against each other, leading to a simple expression of z' 
in terms of 2:: 

(4.8) 

This is precisely the standard expression of Mobins transformations, eq. (3.23): Lorentz 
transformations coincide with conformal transformations of the celestial sphere. This is 
the resnlt we wanted to prove. 


z = 


az + b 
cz + d 


C>(l/r). 


Remark. In deriving (4.8), onr choices of conventions played an important role. In¬ 
deed, we conld have dehned the homomorphism / : SL(2,C) —)■ of snbsection 2.3 

by acting on Hermitian matrices of the form (2.23), bnt with different signs in front of 
the components x^. (The standard choice [5] wonld correspond to changing the sign in 
front of x^.) This wonld have led to a different expression of the homomorphism (2.30), 
which in tnrn wonld have given a different formnla for the action of Lorentz transfor¬ 
mations on celestial spheres. For instance, the terms az + b and cz + d wonld then be 
replaced by combinations snch as az — b and —cz + d, or az + b and cz + d, or other 
variations on the same theme. Bnt the statement that Lorentz transformations act as 
conformal transformations on celestial spheres remains trne regardless of one’s choices of 
conventions. Fnrthermore, the physical effect of snch conformal transformations is also 
convention-independent; we will see an illnstration of snch a physical (actnally, optical) 
effect in the next snbsection. 


4.3 Boosts and optics 

It is worthwile to analyse the transformation law (4.8) for certain specific examples of 
Lorentz transformations. Namely, recall that the homomorphism (2.30) allowed ns to 
represent rotations and boosts by the SL(2,C) matrices (2.31) and (2.32), respectively. 
We can then ping these matrices in eq. (4.8) and interpret the resnlting formnla as the 
conformal transformation of the celestial sphere that corresponds to a change of frames 
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between two inertial observers, say Alice and Bob. For instance, a rotation by 6 along 
Alice’s axis corresponds to a rotation of the sphere represented hy z ^ e~^^z. This is 
not surprising: if the frames of Alice and Bob are rotated with respect to each other, it 
is obvious that their respective celestial spheres will be identical, up to a rotation. 

4.3.1 Boosts, optics and the Millenium Falcon 

A more interesting phenomenon occurs when Bob is boosted with respect to Alice, with 
rapidity y say. The stereographic coordinate z' of the celestial sphere seen by Bob is then 
related to the coordinate z of the sphere seen by Alice according to 

z' = e~^z. (4.9) 

Let us take y > 0 for dehniteness, he. let us assume that Bob moves in the direction of 
positive ^ towards the north pole of the sphere, located at z = 0. Then eq. (4.9) tells 
us that, although Alice and Bob are looking at the same celestial sphere, the points of 
Bob’s sphere are all pulled closer to the north pole than the points of Alice’s sphere. If 
we imagine that shining stars are glued to the celestial sphere, then the stars seen by Bob 
are grouped closer to the north pole (he. closer to the direction of Bob’s motion) than 
those seen by Alice. 



Figure 11: The conformal transformation of the celestial sphere corresponding to a boost 
towards the north pole: all points of the sphere are dragged along the arrows, towards 
the north pole. Equivalently, all points are dragged away from the south pole (which is 
not visible in this picture). In terms of stereographic coordinates obtained by projection 
through the south pole, this transformation corresponds to a contraction of the Riemann 
sphere, 2 ; h->• e~^z with y > 0. 


This result is somewhat counterintuitive, if we base our intuition on our habit of 
objects flowing past us when driving on the highway. Roughly speaking, one would expect 
that boosting in a given direction should make objects spread away from that direction. 
This intuition is well illustrated in the movie Star Wars Episode IV: A New Hope [48] . In 
the screenshot reproduced below, Han Solo and Chewbacca are sitting in the cockpit of 
the Millenium Falcon spaceship and have just switched on the “hyperspace” mode — they 
are accelerating straight ahead. This acceleration corresponds to a continuous family of 
boosts in the direction of the acceleration. In the picture, these boosts are represented 
by stars flowing away from the direction of the motion, exactly as dictated by the naive, 
intuitive expectation just described: 
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Formula (4.9) tells us that this representation of the “jump to lightspeed” is wrong: 
what Han Solo and Chewbacca should really see is a contraction of the sphere at which 
they are looking, towards the direction of their acceleration. In other words, as long as 
the stars are far enough from the observer undergoing a boost, they should cluster close 
to the direction of the boost rather than flow away from it. 

There is of course a subtlety in this argument: our intuition of objects flowing past 
us when we drive on the highway is obviously correct, so how come it contradicts the 
result (4.9)? The answer is that formula (4.9) holds only in the limit r —>■ +oo, when the 
sphere we are talking about is located at an inhnite distance from the observer. In that 
limit, the observer’s motion does not affect its distance to a point on the celestial sphere; 
in particular, all points on the sphere remain at an inhnite distance from the observer, 
and there is no way they could how past him. In real-world applications, however, all 
objects are necessarily located at some finite distance, in which case the corrections of 
order 1/r neglected in (4.8) become relevant. In particular, when Bob is moving with 
respect to Alice, the relation between his Bondi coordinates and those of Alice involves 
some time-dependent factors in the 1/r corrections. These corrections imply that the 
objects seen by Bob (be it stars, or cows on the side of the highway) do indeed how past 
him when he gets close enough to them. In this sense the picture of the Millenium Falcon 
cockpit shown above is not completely wrong. Still, for stars located sufficiently far from 
Han Solo and Chewbacca, the 1/r corrections in (4.8) are negligible and the optical ehect 
described by the contraction (4.9) is valid. 

4.3.2 Subtleties 

While 1/r corrections are the most obvious source of modihcations to the result (4.9), 
there are several other caveats in trying to apply Lorentz transformations to realistic 
situations such as the jump to lightspeed in the Millenium Falcon. The hrst is the fact 
that the motion of the spaceship during the jump is actually accelerated, so that Han 
Solo and Chewbacca are dehnitely not inertial observers! This does not prevent us from 
guessing what should happen: roughly speaking, accelerated motion may be seen as an 
inhnite sequence of inhnitesimal boosts, so if (4.9) remains valid for each inhnitesimal 
boost, one expects the celestial sphere seen by an accelerated observer to undergo a time- 
dependent contraction (in the direction of acceleration), with a scaling factor that gets 
smaller and smaller as time goes by. More precisely, since rapidity is the integral (1.23) 
of proper acceleration, the naive application of (4.9) to accelerated motion predicts that 


48 







an accelerated observer, looking at the celestial sphere in the direction of his acceleration, 
should see a proper-tinie-dependent contraction with a scaling factor exp[—/(s)/c], where 
I{s) is the integral (1-21) of proper acceleration over proper time. 

The potential problem with this expectation is that special relativity was established 
for inertial observers from the very beginning, so one might fear that acceleration in¬ 
validates the application of special-relativistic techniques to the Millenium Falcon. For¬ 
tunately, there is in principle no obstacle in describing accelerated observers in special 
relativity [30]. For example, Thomas precession is a well known special-relativistic effect 
that applies to such observers [30,49], and it is precisely derived by thinking of acceler¬ 
ated motion as a sequence of inhnitesimal boosts. The only issue is that the reference 
frames associated with accelerated observers^^ are not global coordinate systems — they 
do not cover the whole of space-time. This is related to the existence of horizons: for 
instance, a uniformly accelerated observer in Minkowski space-time — a Rindler observer 
— cannot receive light rays coming from behind his future horizon [30]. Thus, since our 
dehnition of celestial spheres relied on the limit r —)■ -|-cx) in Bondi coordinates, one may 
wonder whether acceleration invalidates our approach, as the limit may be ill-defined. 
We will not attempt to address this issue here, but we will rederive formula (4.9) in a 
local way at the end of this section. This will conhrm that the optical effects of boosts 
on the celestial sphere do not actually rely on a large r limit, as already mentioned at 
the end of subsection 4.1. In particular, the local nature of the derivation implies that it 
remains valid for an accelerated observer in the sense that acceleration deforms the shape 
of light-cones centered on the observer — as is of course well-known in general relativity. 
Whether this deformation can be seen by an “asymptotic” computation analogous to the 
one explained above is another matter, which we will not discuss. 

A second subtlety to be considered is the fact that the light seen by Han Solo and 
Chewbacca during the jump to lightspeed is blue-shifted due to the Doppler effect. As 
the velocity of the Millenium Falcon increases, the frequency of the light rays hitting 
the observers inside the cockpit increases as well. Eventually, the increase in frequency 
should become so high that the stars actually become invisible — the starlight seen by 
the pilots of the spaceship has reached the ultraviolet region. Thus, the stars seen by 
Han Solo and Chewbacca not only move in the direction of acceleration, but they also 
change colour, becoming blue, then purple, then invisible^^. 

This blue shift applies of course to any electromagnetic radiation reaching the ob¬ 
servers inside the cockpit. In particular, it applies to the cosmic microwave background 
radiation^'^. Thus, at sufficiently high velocities, the background radiation should reach 
the visible spectrum and the actual picture seen from the cockpit of the Millenium Falcon 
should include a fuzzy disc of light centered around the direction of the motion [50,51]. 
Upon taking this effect into account and recalling that most stars become invisible be¬ 
cause of the blue shift, one concludes that the actual landscape seen by Han Solo and his 

^^Such reference frames are usually defined by attaching a Fermi-Walker transported tetrad to the 
world-line of the observer, then using this tetrad to define a local coordinate system; see [30], chap. 6. 

^^Strictly speaking, they become invisible to a human eye — to the best of our knowledge, it is not 
known whether Wookies are able to see a broader spectrum of electromagnetic radiation than human 
beings: while the stars definitely become invisible to Han Solo, they might still be visible to Chewbacca. 

^^Here we are assuming that the Star Wars took place in a universe that started off with a Big Bang. 
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hairy companion is indeed far, far away from the image shown in the movie. 


4.3.3 A local derivation 

We have just seen that boosting an observer in a given direction affects his celestial sphere 
by contracting all points of the sphere towards that direction. Given the counterintuitive 
nature of this optical phenomenon, it is worthwile to rederive it using a different tech¬ 
nique. Namely, consider two inertial observers, Alice and Bob, using inertial coordinates 
(x^) and {x'^) respectively. We take Bob’s coordinates to be boosted, with rapidity y, 
with respect to those of Alice. For dehniteness, we will assume that the boost takes place 
along the x^ direction, so that the relation between Bob’s coordinates and Alice’s coor¬ 
dinates is x'^ = K^vX'^, with A the matrix (1.19). Now suppose Alice and Bob both see 
one incoming photon, whose energy-momentum vector is p = {E,—E cos9,—E 
in Alice’s coordinates, and p' = {E', —E' cos6', —E'sm6',0) in Bob’s coordinates. (Here 
E and E' are the photon’s energy in Alice’s and Bob’s frames, respectively.) The angle 
9 (resp. 9’) is the angle between the photon’s direction and the axis x^ = x'^ in Alice’s 
(resp. Bob’s) frame. The question is: what is the relation between 9' and 91 


Since 4-momentum transforms under boosts just as standard inertial coordinates do 
(the energy-momentum vector is a “four-vector”), we know that the photon’s 4-momentum 
in Bob’s and Alice’s coordinate systems are related by p' = A ■ p. Explicitly, this means 


that 

( E' \ 

—E' cos 6*' 
-E' sin 9' 

\ 0 ; 

From this we read off tan^' 
terms of half angles as 


/ cosh y — sinh y 0 0\ 
— sinh y cosh y 0 0 
0 0 10 

\ 0 0 0 1 / 

sin0/(sinhy -|- coshy cos^ 


( E \ 

—E cos 9 
—E sin 9 
\ 0 / 

, which can be rewritten in 


tan(6''/2) 


1 

tan(6*'/2) 


e ^tan(6'/2) 


1 

e~^ tan(6'/2) 


This is a quadratic equation for tan(6*Y2) as a function of tan(0/2). The solution that 
ensures 9' = 9 when y = 0 is the simplest one. 


tan(0Y2) = e ^tan(6'/2). 


(4,10) 


Since here 9 and 9' should be thought of as standard azimuthal coordinates on the sphere 
in unprimed and primed coordinate systems, we can relate them to the stereographic co¬ 
ordinate z through relation (3.18). The result (4.10) thus coincides with the contraction 
(4.9), as it should. In particular, provided 9 is in the hrst quadrant (between 0 and vr/2), 
9' is smaller than 9 when the rapidity y is positive. (Conversely, when 9 is larger than 
7 r/2, then 9' is larger than 9, corresponding to the fact that points of the celestial sphere 
located in the direction opposite to the boost undergo a dilation.) 


The important difference between this computation and the one based on Bondi co¬ 
ordinates is the fact that here we never needed to take a “large r” limit. The result (4.10) 
is valid locally, for any boosted observer detecting a light ray. This implies in particular 
that an accelerated observer looking in the direction of his/her acceleration should see a 
time-dependent contraction of the celestial sphere, just as mentioned above for the case 
of Han Solo and Chewbacca. 
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5 Conclusion 


Let us take a look back at what we have done. We have seen how the Lorentz group 
arises as the set of homogeneous coordinate transformations between inertial observers in 
Minkowski space-time. Since it consists of linear transformations, it can be represented 
in terms of matrices. We have then shown that (the maximal connected subgroup of) 
this matrix group is isomorphic to SL( 2 ,C)/Z 2 — a type of relation that also occurs in 
other space-time dimensions. As observed in section 3, the group SL(2, C)/Z 2 also arises, 
somewhat coincidentally, as the group of conformal transformations of the sphere. The 
question, then, was whether there exists a relation between the action of Lorentz trans¬ 
formations on space-time and that of Mobius transformations on a sphere. We answered 
this question positively, by showing that the difference between the celestial spheres seen 
by two inertial observers whose coordinates are related by a Lorentz transformation is 
precisely a conformal transformation. Finally, we used this relation to discuss the slightly 
unexpected optical phenomenon associated with boosts: we saw that an observer boosted 
in a given direction sees the stars of his/her celestial sphere being dragged towards that 
direction. This result is applicable, in particular, to the jump to lightspeed as seen from 
the cockpit of the Millenium Falcon. 

As emphasized in the introduction, the surprising aspect of the relation between 
Lorentz and conformal transformations is the fact that it links the action of a group 
on a four-dimensional space to its action on a two-dimensional manifold. Of course, 
from a mathematical viewpoint there is nothing wrong with that, but from an intuitive 
viewpoint it is not a priori obvious that such a connection has any physical meaning — 
i.e. that this relation can actually be seen in a concrete experiment, such as accelerating 
in a spaceship. The purpose of these notes was to unveil that meaning, which is well 
known in the literature but perhaps less well known to undergraduate students following 
a course in special relativity, group theory, or even general relativity. 

In fact, part of the motivation for these lectures was that the idea of relating some 
space to a lower-dimensional subspace is closely connected to certain recent developments 
in the study of (quantum) gravity, all encompassed under the general name of holography. 
Recall that a hologram is a two-dimensional surface that produces a three-dimensional 
image — such an optical device is typically found on credit cards or banknotes. The idea 
of holography in quantum gravity [21-25] roughly states that there exists a correspon¬ 
dence (and in certain regimes an actual equivalence) between a gravitational system in d 
space-time dimensions and some quantum theory living on a lower-dimensional subspace 
of the gravitational system — one says that the two theories are “dual” to each other. 
In particular, according to this idea, the four-dimensional world that we see around us 
might be a “hologram” of some lower-dimensional theory. This correspondence is moti¬ 
vated by countless computations matching quantities evaluated on the high-dimensional, 
gravitational side, to some other quantities evaluated on the low-dimensional side; the 
interested reader may consult the abundant literature on the subject. 

The modest result derived in these notes may be seen as a remnant of the holographic 
principle: we have shown that Lorentz invariance in four dimensions becomes conformal 
invariance in two dimensions upon focusing on celestial spheres. In fact, this feature 
is only part of a much larger construction, that is still under study today. Indeed, 
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it was shown in the sixties by Bondi, van der Burg, Metzner and Sachs [13,14] that 
the natural symmetry group of four-dimensional “asymptotically Minkowskian” space- 
times is an inhnite-dimensional extension of the Poincare group, known nowadays as the 
Bondi-Metzner-Sachs (BMS) group. The transformations of space-time generated by this 
group precisely act on “null inhnity”, the region r —)■ -|-cx) that we used in section 4 to 
dehne celestial spheres, and extend the natural action of Poincare transformations on that 
region. In the holographic context, the BMS group is to be interpreted as the symmetry 
group of the would-be (as yet conjectural) dual held theory; the latter, if it exists, is 
expected to be some version of a conformal held theory (recall the brief discussion of 
subsection 3.4), since Lorentz transformations are part of the symmetry group and act 
as conformal transformations on celestial spheres. BMS symmetry has recently been the 
focus of renewed interest, as it was shown that it can be extended to include arbitrary, 
local conformal transformations of the celestial spheres [15-17], and also that it is related 
to standard quantum held theory in Minkowski space through certain “soft theorems” that 
were known in a completely diherent language ever since the sixties [52] (see e.g. [18-20], 
references therein, and their follow-ups). Many more open problems remain to be settled, 
both regarding holography in general, and BMS symmetry in particular; the hope of the 
author is that addressing such questions may open the door to a deeper understanding 
of quantum gravity. 
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